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I.  Overview 


In  the  past  30  years,  fiber  optic  communications  and  optical  networks  have  progressed 
from  a  laboratory  concept,  to  a  multi-billion  dollar  telecommunications  industry  and 
recently  on  to  form  the  backbone  technology  of  the  next  generation  Internet.  The 
successful  transmission  of  multiple  wavelengths  on  a  single  fiber  coupled  with  robust 
optical  amplifiers  and  dispersion  compensation  has  enabled  transmission  of  high-speed 
signals  over  thousands  of  kilometers  without  converting  the  signal  back  to  electronic 
form.  Single  wavelength  transmission  rates  have  increased  from  50  Mbit/s  18  years  ago 
to  40  Gigabits/sec  (Gbps)  in  2001.  Commercial  systems  with  40-80  wavelengths 
operating  at  10  Gbps  per  wavelength  are  being  installed  and  systems  capable  of 
delivering  10  Gbps  on  each  of  160  wavelengths  have  been  commercially  demonstrated. 
Ultra-high-Capacity  transmission  systems  with  greater  than  10  Terabits/sec  over  a  single 
fiber  have  been  demonstrated  in  the  laboratory  (1  Tbps  =  1012  bits/sec). 

We  have  entered  a  new  communications  age  that  increasingly  depends  on  the 
transmission  and  dissemination  of  Internet  traffic.  As  the  world  moves  to  upgrade  the 
existing  communications  network  to  accommodate  this  change,  the  exponentially 
increasing  demand  for  packet-based  traffic  has  outstripped  traditional  voice  traffic  as 
illustrated  in  Figure  1.  The  rapid  deployment  of  DSL  and  cable  modem  technologies 
coupled  with  real-time  applications  like  video  streaming  will  mean  that  an  additional  100 
million  homes  will  require  Internet  access  via  100  Mbit/s  to  Gigabit  Ethernet  connections 
per  home. 


Figure  1.  Voice  vs.  Internet  traffic  growth  for  Greenfield  Networks. 

[Source:  “Greenfield  Optical  Switched  Transport  Networks:  A  Cost  Analysis,”  C.R. 
Lima,  M.Allen  and  B.Faer,  Valiant  Networks,  Inc.,  NFOEC  Technical  Proceedings,  pp. 

1449-1457  (2001).] 
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A  Roadmap  to  Next  Generation  Optical  Networks 

The  current  bandwidth  explosion  is  placing  demands  on  the  network  that  moves  beyond 
the  transmission  bandwidth  and  into  the  switching  and  routing  capacity.  It  has  become 
costly  to  switch  and  route  the  complete  fiber  transmission  bandwidth  using  electronic 
technology  alone,  especially  as  wavelength  counts  exceed  40  and  bit-rates  climb  to  10 
Gbps  and  beyond.  The  next  generation  of  fiber-optic  networks  will  see  a  new  partnership 
between  optical  switching  and  advance  electronic  technologies,  where  data  is  switched  in 
the  optical  domain  until  it  is  necessary  to  interact  with  individual  bits  using  electronics. 
Optical  switching  will  undergo  an  evolution  that  mirrors  advances  in  technology.  The 
agility  with  which  optical  signals  can  be  reconfigured  has  moved  from  static  environment 
to  a  more  dynamic  optical  circuit  switched  environment  using  reconfigurable  photonic 
crossconnects  (PXCs)  and  optical  add/drop  multiplexers  (OADMs)  as  illustrated  in  the 
roadmap  in  Figure  2.  Research  today  has  moved  to  higher  degrees  of  agility  where 
signals  are  regenerated  and  converted  to  new  wavelengths  all-optically.  Switching 
technologies  go  beyond  circuits  into  switching  bursts  of  data  and  optical  packet 
switching. 
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Figure  2.  Evolutionary  roadmap  of  optical  networks. 

[Source:  D.  J.  Blumenthal,  OEDA  Workshop  on  Wavelength- Agile  Photonic  Systems, 

Rhode  Island,  October  18-19  (2001)] 
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MOST  Multidisciplinary  Research 

The  MOST  Center  at  UCSB  is  a  broad  multidisciplinary  research  program  sponsored  by 
the  DoD  Multidisciplinary  University  Research  Initiative  (MURI)  program.  Over  the  5- 
year  lifetime  of  the  center,  a  broad  range  of  research  topics  related  to  optical  switching, 
the  underlying  technologies  and  relevant  network  topics  were  supported  and  investigated. 
Photonic  transmission  and  device  researchers  worked  closely  with  protocol  researchers 
and  electrical  integrated  circuit  designers.  The  high-performance  requirements  of  optical 
switching  systems  and  components  required  the  close  involvement  of  growth  and 
processing  specialists.  The  MOST  Center  and  its  research  components  over  the  Center 
lifetime  is  illustrated  in  Figure  3. 


Optical 

°Ptical  Switch 

Protocols  Architectures 


Figure  3.  MOST  Center  multidisciplinary  vision. 


This  publication  represents  the  collective  work  of  multiple  faculty  at  UCSB  and  their 
students  and  collaborators  over  the  5  year  period  of  the  MURI  award.  We  have 
assembled  a  representative  collection  of  publications  and  MURI  reviews  as  well  as  an 
overview  of  personnel  supported,  graduated  students,  facilities  and  technology  transfer. 
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HI.  Group  Summaries 


3D  Photonic  Circuits  and  Active  Devices  for  OTDM  Systems 


John  E.  Bowers 

We  have  developed  different  elements  of  three-dimensional  (3D)  photonic  circuits  for 
sophisticated  switching  applications.  Wafer  bonding  is  the  enabling  technology  that 
allows  the  fabrication  of  complex  3D  photonic  waveguide  structures  not  realizable  with 
any  other  approach.  Figure  1  shows  a  schematic  view  of  a  wafer-bonded  3D  photonic 
circuit.  By  vertical  waveguide  coupling  across  the  fused  interface,  we  achieve  high 
coupling  efficiency  at  short  coupling  lengths.  By  repeatedly  etching  waveguides  and 
fusing  additional  planar  layers,  multilevel  interconnects  are  fabricated.  Several  novel  3D 
routing  waveguide  devices  have  been  created  from  InP/InGaAsP  or  GaAs/AlGaAs: 


Fig.  1 :  Schematic  view  of  a  three-dimensional  photonic  circuit 


(1)  By  anti-phase  fusing  two  waveguide  structures,  the  anisotropy  of  the  linear 
electrooptic  effect  in  zincblende  crystals  was  used  to  produce  an  opposite  index 
change  in  two  coupled  waveguides  with  only  two  contacts.  A  push-pull  fused  vertical 
coupler  switch  using  this  effect  was  realized  and  switched  under  12  V  reverse  bias 
(Fig.  2). 

(2)  A  4-channel  wavelength  multiplexer/demultiplexer  was  also  built  by  combining  3D 
vertical  couplers.  For  this  device  17  nm  channel  spacing  with  crosstalk  less  than  20 
dB  was  achieved. 
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Fig.  2:  Push-pull  wafer-bonded  vertical  coupler  switch.  Top:  design,  bottom:  performance. 
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Fig.  3:  Three-layer  wafer-bonded  1:8  beam  splitter. 

(3)  A  novel  three-layer  double-bonded  InP-InGaAsP  waveguide  vertical  coupler  1:8 
beam  splitter  was  demonstrated  (Fig.  3).  The  strongly  coupled  waveguides  allow  a 
583-micron  device  length,  more  than  one  hundred  times  shorter  than  that  of  the 
equivalent  horizontal  coupler.  The  device  illustrates  the  use  of  multiple  vertical  layer 
optical  interconnects  for  three-dimensional  routing  of  optical  signals. 
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(4)  Multi-channel  vertically  coupled  InP/InGaAsP  optical  add-drop  multiplexers 
(OADMs)  were  fabricated  for  wavelength  division  multiplexing  (WDM)  applications 
(Fig.  4). 


Fig.  4:  InGaAsP/InP  optical  add-drop  multiplexer. 


180  pF  50  Q  tandem 
di-cap  resistor  EAM 


Fig.  5:  Integrated  Tandem  Electroabsorption  Modulator. 

Another  field  of  our  research  was  the  development  of  low  voltage,  high  extinction  ratio 
and  large  bandwidth  traveling  wave  electroabsorption  modulators  (TWEAMs)  and  their 
application  in  optical  time  division  multiplexing  (OTDM)  systems.  We  demonstrated 
TWEAMs  with  30  GHz  bandwidth  and  1.2  V  switching  voltage  (Fig.  5).  A  four-channel 
120  Gbit/s  OTDM  system  was  build  that  reduces  the  number  of  optical  channels  by 
increasing  the  base  rate  to  30  Gbit/s  (Fig.  6).  Integrated  tandem  modulators  were  used  to 
(1)  pulse  shape  and  modulate  data,  and  (2)  demodulate  and  detect  data.  This  experiment 
was  conducted  at  4  x  30  Gbit/s  because  that  was  the  fastest  electrical  multiplexer  that  was 
available.  The  technology  is  certainly  usable  to  achieve  160  Gbit/s  and  probably  320 
Gbit/s.  The  integration  of  tandem  devices  is  important  to  reduce  the  coupling  loss,  and  to 
allow  more  complex  processing  than  simple  modulation  or  demultiplexing.  In  another 
experiment,  simultaneous  TDM  drop  and  detect  was  demonstrated.  A  single  device  was 
used  to  drop  one  TDM  channel  and  pass  the  other  three  channels.  The  absorption  of  the 
one  channel  generates  a  photocurrent  that  can  be  monitored  to  receive  the  incoming  data. 
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This  approach  is  much  simpler  than  having  a  fast  1x2  switch  followed  by  a  fast 
photodetector. 


30  GHz 


13.3  ps/div 


Fig.  7:  Left:  Eye  diagrams  (13.3ps/div,  limited  by  40  GHz  photodetector  &  7.5  GHz 
scope  trigger).  Right:  Bit  error  rate  measurement. 


Large-Scale  Wavelength  and  Space  Switching 
using  Widelv-Tunable  Lasers 


Larry  A.  Coldren 

In  this  project  the  photonic  IC  technology  previously  developed  at  UCSB  for  widely  tunable 
lasers  was  applied  to  two  new  efforts:  an  all-optical  cross-bar  switch,  and  an  optoelectronic 
wavelength  converter.  The  first  effort  resulted  in  4  x  4  optical  cross  bar  switches  with 
approximately  zero  dB  insertion  loss  and  a  40  dB  on/off  and  interchannel  crosstalk  level.  This 
was  all  accomplished  on  a  single  chip  of  InP  measuring  less  than  2mm  on  a  side.  These  results 
represent  the  state  of  the  art  in  such  switches. 

The  crossbar  switch  fabric  consisted  of  four  input  and  four  output  waveguides,  spaced  by  250 
microns,  which  intersected  at  right  angles.  At  the  intersection  a  pair  of  novel  1x2  suppressed 
modal  interference  (SMI)  switches  on  the  both  the  input  and  output  guides  connected  a  bypass 
semiconductor  optical  amplifier  (SO A)  that  could  be  turned  on  simultaneously  with  the  SMIs  to 
provide  switching  from  the  bar  to  the  cross  state.  These  SOAs  provided  good  isolation  in  the  off 
(or  bar)  state  and  loss  compensation  in  the  on  (or  cross)  state.  SOAs  and  the  inputs  to  and  outputs 
from  the  chip  compensated  for  fiber  coupling  losses. 

In  the  latter  years  of  this  MOST  project,  this  same  photonic  integration  platform  was  used  to  form 
widely-tunable  wavelength  converters.  In  this  case,  an  integrated  O/E/O  approach  is  taken  using 
technology  compatible  with  the  formation  of  the  Sampled-Grating  DBR  (SGDBR)  lasers.  It  is 
proposed  that  these  wavelength  converters  can  then  be  integrated  with  compact  wavelength 
routing  elements,  such  as  Arrayed-Waveguide  (AWG)  routers,  to  create  a  space  switch. 
Additional  wavelength  converters  may  be  desired  at  the  output  to  convert  the  wavelength  to  some 
desired  channel. 

Modelmg  indicates  that  efficient,  format-independent  wavelength  conversion,  and  therefore 
switching,  can  be  accomplished  up  to  data  rates  of  40Gb/s  with  devices  occupying  an  area  of 
about  1  mm  .  For  low  data  rates  (<10  Gb/s)  direct  modulation  of  the  SGDBR  is  possible,  and 
lossless  conversion  should  be  possible  by  directly  wiring  the  photocurrent  of  a  photodiode  or 
APD  to  the  gain  region  of  an  SGDBR  which  includes  an  integrated  SOA  post-amplifier.  For 
higher  frequencies,  an  integrated  modulator  is  required,  but  this  also  generally  requires  some 
form  of  electronic  amplification  to  obtain  lossless  operation.  An  integrated  FET  may  suffice,  but 
flip-chip  bonding  of  the  InP  photonic  chip  to  some  electronic  chip  may  be  desired  in  this  case. 
With  such  electronic  interfaces,  2R  and  3R  regeneration  functions  may  also  be  incorporated. 

Initial  experiments  have  provided  SGDBR  lasers  with  integrated  SOAs  and  EAMs  which 
function  according  to  design.  CW  output  powers  exceeding  30  mW  has  been  observed,  and 
modulation  bandwidths  for  both  direct  modulation  of  the  gain  section  and  external  modulation  of 
the  integrated  EAM  have  provided  bandwidths  >  5  GHz.  Some  of  this  work  was  done  in 
collaboration  with  Agility  Communications.  Additional  work  on  packaging  and  testing  of  these 
devices  continues.  Avalanche  photodetectors  have  also  be  fabricated  and  tested  in  collaboration 
with  Prof.  J.  C.  Campbell  of  UT-Austin.  Using  his  narrow  avalanche  region  design,  APDs  with 
reasonable  gain  (>  10)  with  low  excess  noise  factors  (<  3)  have  been  fabricated  and  measured. 
Masks  have  been  designed  to  integrate  detectors  with  the  widely-tunable  lasers,  and  further  work 
is  underway  with  additional  support  from  industrial  collaborators. 
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Fig.  1:  Monolithic  Wavelength  Converter 
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Fig.  2  Crossbar  Space  Switch 
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Compact,  low-loss,  electronically  tunable  filters  based  on 
hybrid  integration  of  polymers  and  compound  semiconHnrtnrg 

Nadir  Dagli 


TxmaWc  filtering  is  a  key  technology  for  the  successful  implementation  of  WDM  networks  It  is 
difficult  to  satisfy  all  the  requirements  of  tunable  WDM  filtering,  such  as  low  insertion  loss,  low 
cost,  low  crosstalk,  wide  and  fast  tunability  using  a  single  technology.  In  our  work  we  utilize 
hybrid  integration  of  polymers  and  compound  semiconductors  to  realize  WDM  components  with 
superior  characteristics.  Polymers  are  used  for  low  propagation  and  fiber  coupling  loss 
Compound  semiconductors  are  used  for  high  contrast  gratings  and  electro  optic  properties  needed 
technology^  ^  tUnabiHty‘  We  fabricated  and  characterized  two  types  of  filters  using  this 

The  first  one  relies  on  the  forward  coupling  between  a  polymer  and  a  semiconductor  waveguide 

T;  S“8!f  D^^egUide  1S  fabricated  us“g  Polymers  Benzocyclobutene  (BOB), 
photo  definable  BCB  (PDBCB)  and  Polymethylglutarimide  (PMGI).  A  separately  fabricated 
GaAs  waveguide  was  glued  on  the  polymer  waveguide  using  one  of  the  polymers  as  glue. 

inally  the  growth  substrate  for  the  GaAs  waveguide  was  removed  by  substrate  etching  forming 
a  coupled  structure  GaAs  and  polymer  waveguides  have  very  different  indices  and  index 
ispersion.  Therefore,  when  coupled  they  act  as  a  veiy  asymmetrical  coupler.  Such 
asymmetrical  couplers  behave  as  narrow-band  filters.  In  our  approach  input  and  output  coupling 
to  single  mode  fibers  is  done  through  the  single  mode  polymer  waveguide.  This  results  in  a  band 
stop  fiber  Furthermore  fiber  to  fiber  coupling  loss  can  be  low.  In  our  experiments  we  observe 
ess  than  6  dB  fiber  to  fiber  insertion  loss,  which  can  be  further  improved  by  optimizing  the  mode 
shape  of  the  polymer  waveguide.  By  changing  the  electrical  thickness  (the  product  of  the 
physical  thickness  and  the  mdex)  of  the  GaAs  waveguide  coupling  wavelength  can  be  tuned.  In 
our  experiments  we  observe  about  2  nm  bandwidth  with  more  than  15  dB  suppression  in  a  3  mm 
ong  device.  We  were  also  able  to  tune  the  center  wavelength  by  more  than  50  nm  by  changing 

the  thickness  of  the  GaAs  waveguide.  We  also  observe  fast  electrical  tuning  in  addition  to  tuning 
based  on  etching.  B 

The  second  type  filter  based  on  the  hybrid  integration  of  polymers  and  compound  semiconductors 
is  very  similar  to  the  previous  one.  In  this  case  a  grating  on  the  GaAs  slab  waveguide  was 
introduced.  Reflectivity  of  a  grating  in  between  two  coupled  dissimilar  waveguides  can  be 
changed  drastically  by  changing  the  electrical  thickness  of  one  of  the  waveguides.  The 
reflectivity  change  is  due  to  the  modification  of  electrical  field  strength  on  the  grating.  In  the 
transverse  plane  the  field  profile  of  dissimilar  coupled  waveguides  is  very  similar  to  that  of  a 
abiy  Perot,  hence  is  very  wavelength  sensitive.  As  a  result  by  changing  the  electrical  thickness 
0  e  GaAs  waveguide  as  in  the  previous  case,  field  strength  on  the  grating  can  be  modified 
strongly.  This  in  turn  modifies  the  reverse  coupling  coefficient  of  the  grating.  Hence  the 

rjj/jtlVlt^  °i  ?\e  £TatblS  can  be  fumed  on  and  off.  Such  a  reflective  device  can  work  as  an 
add/drop  multiplexer  when  combined  with  two  circulators  at  the  input  and  output  We 
demonstrated  turning  a  grating  on  and  off  this  way  by  changing  the  thickness  of  the  GaAs 

waveguide  for  a  3  mm  long  device.  Fast  electrical  tuning  should  also  be  possible  just  like  in  the 
previous  case.  J 

The  components  demonstrated  in  this  work  have  the  potential  of  being  the  building  blocks  of  compact,  low- 
loss  and  fast  tunable  filters  that  can  be  utilized  in  optical  networks. 
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Fig.  1:  Hybrid  Waveguide  Structure 
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Fig.  2:  Transmission  characteristics  at  different  bias. 
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Optical  Packet  Switching, 

Label  Swapping  and  Wavelength  Conversion  Technologies 

Dan  Blumenthal 


All-optical  label  swapping  is  a  promising  approach  to  ultra-high  packet-rate  routing  and 
forwarding  directly  in  the  optical  layer.  The  overall  network  approach  is  to  encapsulate 
packets  with  optical  labels  and  process  forwarding  and  routing  functions  independent  of 
packet  bit  rate  and  format.  We  have  experimentally  demonstrated  results  various 
components  and  label  swapping  functions  and  demonstration  of  40  Gbit/s  optical  label 
swappmg.  To  allow  operation  at  ultra-high  bit  rates  we  have  demonstrated  a  new 
wavelength  converter  based  on  nonlinearities  in  an  optical  fiber  that  supports  operation  at 
40  Gbit/s  and  beyond  (possibly  up  to  several  hundered  Gbit/s).  In  this  scheme  we  utilize 
XPM  in  a  dispersion  shifted  fiber  imposed  from  the  incoming  data  onto  CW  light, 
followed  by  conversion  of  the  phase  modulation  to  amplitude  modulation  using  optical 
filtering.  The  incoming  data  is  combined  with  a  CW  light  signal  and  sent  through  an 
optical  fiber  where  the  data  imposes  a  phase  modulation  onto  the  CW  light  via  fiber 
XPM.  This  phase  modulation  generates  optical  sidebands  on  the  CW  signal,  which  are 
converted  to  amplitude  modulation  by  suppressing  the  original  CW  carrier  using  an 
optical  notch  filter.  A  conventional  band-pass  filter  to  select  one  of  the  two  sidebands 
then  follows  the  notch  filter.  However,  in  principle  it  is  sufficient  to  use  a  single  band 
pass  filter,  e.g.  a  fiber  Bragg-grating  or  fiber-loop  mirror,  provided  that  it  can  suppress 
the  original  CW  carrier  enough.  This  wavelength  converter  also  erases  2.5  Gbps  NRZ 
header  data.  In  order  to  perform  very  high  speed  optical  multiplexing  and  demultiplexing 
we  collaborated  with  John  Bowers  group  with  respect  to  using  electroabsorption 
modulators  (EAMs). 

We  have  also  investigated  optical  switching  architectures  that  utilize  Wavelength 
Conversion  (WC)  that  can  be  tuned  at  very  high  bit  rates.  We  have  demonstrated  that  the 
same  rapidly  tunable  laser  can  be  used  as  a  rapidly  tunable  WC.  Dynamic  conversion  of 
any  wavelength  to  any  wavelength  using  a  single  GCSR  laser,  where  the  output 
wavelength  is  set  by  rapidly  changing  the  current  in  one  of  the  laser  sections  was 
reported.  Wavelength  conversion  in  this  case  is  achieved  through  the  gain  suppression  of 
the  lasing  mode  by  the  injected  signal  light.  Among  the  most  important  advantages  of  this 
technique  are  the  modulation  format  transparency,  tunability  of  the  converted  signal 

wavelength,  high  conversion  efficiency  and  extinction  ratio  and  single  element 
simplicity. 

Optical  label  swapping  and  wavelength  conversion  switching  technologies  can  be 
combined  to  realize  the  optical  equivalent  of  an  IP  router.  The  Optical  Internet  Protocol 
Router  (OIPR)  project  is  aimed  at  investigating  possible  these  approaches.  When 
combining  optical  packet  switching  technologies  into  a  single  router,  new  issues  arise 
with  packet  buffering  and  switch  blocking.  This  year  we  collaborated  with  J.  Lang  and  E. 
Varvarigos  to  investigate  an  all-optical  packet  wavelength  scheduler/switch.  The 
scheduler  utilizes  wavelength  conversion  and  switching  technology  to  collapse  a  large 
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number  of  programmable  optical  delays  efficiently.  Scalability  of  this  approach  depends 
on  the  ability  to  integrate  many  photonic  devices  on  a  single  chip. 


input  40  Gbit/s  data 


Figure  1.  Experimental  set-up  of  the  ultra-fast  all-optical  wavelength  converter 


Wavelength  converter  and  optical  packet  routing  technology 

The  main  contribution  in  this  project  has  been  to  develop  a  novel  ultra  high-speed  all- 
optical  wavelength  converter,  capable  of  operating  at  40  Gbit/s  and  beyond  [1].  The  basic 
idea  is  to  combine  the  incoming  data  with  continuous  wave  (CW)  light  at  the  desired 
output  wavelength  and  let  them  co  propagate  through  an  optical  fiber  with  high  power. 
The  original  data  will  impose  a  phase  modulation  via  cross  phase  modulation  (XPM)  on 
the  CW  light  according  to  the  incoming  data.  A  filter  arrangement  after  the  fiber  is  used 
to  convert  the  phase  modulation  into  amplitude  modulation  at  the  new  wavelength. 
Figure  1  shows  a  typical  set-up  and  eye  patterns  demonstrate  wavelength  conversion  of 
40  Gbit/s  data.  To  better  understand  the  physics  of  the  wavelength  converter,  extensive 
research  was  performed,  e.g.  by  analyzing  the  pulse  response  and  bandwidth  [2]  as  well 
as  investigating  noise  limitations  in  the  conversion  process  [3].  The  wavelength 
conversion  efficiency  depends  heavily  on  the  derivative  of  the  input  signal  and  thus  only 
return  to  zero  (RZ)  data  can  be  converted.  This  effect  allows  the  converter  to  be  used  in 
packet  routing  systems  using  all-optical  label  swapping  (AOLS)  [4]  if  the  payload  is 
coded  using  high-bit-rate  RZ-data  and  the  label  is  attached  in  the  time  domain  to  the 
payload  and  coded  with  non  return  to  zero  (NRZ)  data.  In  this  case  the  payload  will  be 
converted  to  the  new  wavelength  while  the  label  is  not.  A  new  label  can  then  be  inserted 
either  by  premodulating  the  CW  light  into  the  converter  or  simply  by  adding  it  after  the 
wavelength  converter.  This  feature  was  investigated  in  [5]  at  10  Gbit/s  but  is  certainly 
scalable  to  much  higher  bit-rates,  as  shown  by  simulations  at  160  Gbit/s  in  [2]. 


17 


Packet  transmitter 


Figure  2.  Experimental  set-up  of  40  Gbit/s  packet  routing  and  AOLS  experiment. 


To  demonstrate  AOLS  at  high  bit  rates  an  experiment  was  performed  where  packets  of  40 
Gbit/s  data  was  routed  to  different  wavelengths  using  a  wavelength  converter  [6].  The  40 
Gbit/s  payload  was  in  RZ  format  and  an  attached  time  domain  label  was  encoded  using 
NRZ  at  2.5  Gbit/s.  By  using  a  fast  tunable  laser  each  packet  can  be  routed  to  different 
wavelengths.  The  attached  label  is  also  replaced  upon  wavelength  conversion  by  using 
the  technique  described  above.  Figure  2  shows  the  experimental  set-up  used  in  the 
routing  experiment.  Figure  3a  shows  the  incoming  packets  to  the  node  with  the  NRZ 
labels  at  lower  power  than  the  RZ  payload  to  accomplish  equal  energy  per  data  bit  in 
header  and  payload.  The  lower  trace  shows  the  headers  with  the  payload  switched  off. 
Figure  3b  shows  every  other  packet  routed  to  1538  nm  and  1543  nm  with  new  inserted 
labels.  The  eye  patterns  shown  in  Figure  3c  and  d  were  taken  by  zooming  into  the 
payload,  before  (1536  nm)  and  after  wavelength  converter  (1543  nm).  BER 
measurements  were  performed  on  both  label  and  payload  on  incoming  and  outgoing 
packets  and  less  than  4  dB  penalty  at  a  BER=10'9  was  observed  for  all  routed  packets. 
The  power  penalty  at  a  BER=109  for  the  replaced  labels  was  less  than  2.5  dB.  For 
wavelength  routing  at  10  Gbit/s,  the  penalty  was  less  than  1  dB  for  the  payload,  which 
indicates  that  a  major  part  of  the  penalty  is  due  to  instabilities  in  the  40  Gbit/s  data! 
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Figure  3.  Packets  (a)  and  40  Gbit/s  eye-patterns  (c)  before  wavelength  routing.  Packets 
routed  to  1538  nm  and  1543  nm  (b)  and  eye  pattern  of  routed  packet  at  1543  run  (d). 


Figure  4.  80  Gbit/s  eye  pattern  (a)  and  demultiplexed  10  Gbit/s  channels  (b-i). 

The  wavelength  converter  has  also  several  other  applications.  Figure  4  shows  the 
wavelength  converter  acting  as  an  ultra  fast  demultiplexer  where  incoming  80  Gbit/s  data 


Bit  error  rate 


are  demultiplexed  down  to  8  channels  of  10  Gbit/s  data  [7,8].  Another  application  is 
multiplexing,  where  a  number  of  wavelength  channels  can  be  multiplexed  together  into 
one  optical  time  domain  multiplexed  (OTDM)  channel  [9].  Figure  5  shows  the 
experimental  set-up  for  this  experiment  and  figure  6  shows  bit-error  rate  measurements  of 
the  OTDM  multiplexed  data  at  the  output. 


Figure  5.  Experimental  set-up  for  the  multiplexer. 


Average  received  optical  power  [dBm] 

Figure  6.  Bit-error  rate  measurements  of  one  input  10  Gbit/s  channel,  baseline  (B),  and 
the  four  channels  of  the  output  40  Gbit/s  OTDM  data  stream. 
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Polarization  mode  dispersion 


Polarization  mode  dispersion  (PMD)  is  regarded  as  the  most  predominant  obstacle  for 
fiber  optical  transmission  at  ultra-high  data  rates,  i.e.  40  Gbit/s  and  beyond.  The  impact 
from  PMD  varies  with  wavelength  and  over  time,  which  makes  it  difficult  to  predict 
signal  degradation  in  the  system.  In  a  packet  network  environment  it  is  even  more 
difficult  since  incoming  packets  to  a  node  may  have  traveled  different  fibers  and  thus 
experienced  different  PMD.  Traditionally  the  influence  of  PMD  has  been  treated  using 
statistics,  i.e.  the  probability  of  certain  signal  degradation  is  calculated.  However,  in  a 
network  where  decisions  about  possible  signal  degradation  due  to  PMD  have  to  be  made 
in  real  time,  statistics  cannot  be  used.  Instead  we  would  like  to  measure  some  optical 
parameters  and  determine  the  signal  degradation  due  to  PMD  in  real  time.  One  method 
investigated  under  the  MOST  program  is  to  measure  how  the  stokes  vector  of  the  data 
modulated  light  varies  across  the  optical  spectrum.  From  this  data  the  pulse  distortion  due 
to  PMD  can  be  estimated.  Figure  7  shows  real  and  estimated  pulse  broadening  for 
different  PMD  values.  Another  topic  of  interest  is  how  sensitive  different  types  of  optical 
receivers  are  to  PMD.  Figure  8  shows  a  comparison  of  the  Q-penalty  introduced  by  PMD 
when  using  direct  detection  of  40  Gbit/s  data  and  detection  using  a  40  to  10  Gbit/s  optical 
demultiplexer  with  a  subsequent  10  Gbit/s  receiver.  A  seen  from  figure  8,  the 
demultiplexer  receiver  performs  significantly  better  in  the  presence  of  PMD  than  the  non 
demultiplexed  40  Gbit/s  receiver. 
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Figure  7.  Real  and  estimated  pulse  widths  for  different  values  of  PMD. 
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Figure  8.  Q-penalty  versus  PMD  for  direct  detection  of  40  Gbit/s  data  and  40  Gbit/s 
optically  demultiplexed  to  10  Gbit/s  with  subsequent  10  Gbit/s  detection. 
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Ultrafast  Optics  for  Optical  Communications 

Y.  Fainman 

Ultrashort  pulse  laser  technology  is  commercially  available  today,  producing  high  energy 
pulses  of  optical  radiation  in  the  femtosecond  duration  range.  We  are  investigating 
methods  of  harnessing  such  ultrashort  pulses  to  increase  the  speed,  provide  security  and 
reliability  in  transmission,  distribution  and  storage  of  information  in  communication  and 
computing  applications.  To  implement  these  features,  real-time  all-optical  processing 
techniques  for  achieving  information  conversion  between  slower  opto-electronic  devices 
and  the  ultrafast  pulse  phenomena  need  to  be  constructed.  This  processing  is  performed 
primarily  in  the  temporal  frequency  domain  by  spatially  dispersing  the  frequency 
components  in  a  spectral  processing  device  (SPD)  and  performing  operations  on  the 
spectrally  decomposed  wave  (SDW).  To  fulfill  our  goal  of  real-time  SDW  processing,  we 
utilize  nonlinear  wave  mixing  processes  based  on  three-  and  four-  wave  mixing  in  a 
second  order  nonlinear  crystal  inside  the  SPD.  The  latter  is  achieved  via  cascaded 
second-order  nonlinearities  (CSN).  The  CSN  arrangement  consists  of  a  frequency-up 
conversion  process  followed  by  a  frequency-down  conversion  process  satisfying  the  type- 
II  non-collinear  phase  matching  condition. 

Our  experiments  are  concerned  with  ultrafast  information  exchange  between  spatially 
parallel  signals  and  higher  bandwidth  temporal  signals.  For  the  waveform  synthesis 
experiment,  we  introduce  two  spatial  information  modulated  waves  carried  by  quasi- 
monochromatic  light  and  a  SDW  of  a  ultrashort  femtosecond  pulse  (see  Fig.  la).  The  four 
wave  mixing  process  produces  a  SDW  that  is  a  product  of  three  waveforms:  a  spatial 
Fourier  Transform  (FT)  of  the  two  spatial  information  carrying  waves  and  the  SDW  (i.e., 
temporal  FT)  of  a  femtosecond  laser  pulse.  The  spatial-temporal  information  exchange’ 
(i.e.,  the  generated  SDW)  results  in  a  synthesized  waveform  that  is  a  time-scaled  version 
of  the  spatial  image,  performed  on  a  single  shot  basis  with  femtosecond-rate  response 
time  due  to  the  fast  nonlinearity.  We  performed  several  experiments  to  demonstrate 
various  synthesized  waveforms.  A  spatial  sequence  of  point  sources,  generated  by  a 
cylindrical  lenslet  array,  was  converted  to  an  ultrafast  pulse  packet  (see  Fig.  lb). 
Modulating  the  individual  point  sources  in  the  spatial  domain  resulted  in  an  analogously 
modulated  ultrafast  waveform,  with  no  evidence  of  crosstalk  between  the  pulses.  This 
functionality  may  be  desired  for  an  ultrahigh  bandwidth  optical  communication 
application.  When  an  adjustable  slit  was  placed  in  the  spatial  channel,  the  ultrafast 
waveform  resembled  a  square  pulse  (see  Fig.  lc).  The  duration  of  the  square  pulse  was 
adjusted  in  real  time  by  varying  the  width  of  the  slit.  The  synthesized  waveforms  were 
generated  with  a  high  conversion  efficiency  of  16%. 
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The  inverse  time-to-space  transformation  for  detection  of  femtosecond  pulse  sequences  is 
achieved  using  nonlinear  three-wave  mixing  in  a  1(2)  crystal.  The  two  input  waves  are 
the  SDW  of  a  sequence  of  ultrashort  pulses  that  need  to  be  detected  and  a  reference  pulse. 
The  nonlinear  .interaction  between  the  two  SDW’s  results  in  generating 
quasimonochromatic  second  harmonic  wave.  The  frequency  of  the  second  harmonic 
fields  is  twice  the  center  frequency  of  the  incident  fields.  The  generated  second  harmonic 
fields  contain  spatial  frequencies  determined  by  the  time  delay  between  the  reference 
pulse  and  the  pulses  in  the  signal.  Thus  a  1-D  spatial  FT  of  the  second  harmonic  field 
produces  a  1-D  spatial  image  equivalent  to  the  temporal  cross-correlation  between  the 
reference  and  the  signal  pulses.  With  short  pulses,  the  spatial  image  has  one-to-one 
correspondence  with  the  signal  pulse,  implementing  the  desired  time-to-space 
demultiplexing  at  femtosecond  rates.  The  spatial  image,  formed  by  the  generated  second 
harmonic  field,  reveals  spatially  the  information  on  the  temporal  signal  waveform  and 
can  be  detected  using  slower  electronic  detection  devices. 

For  instantaneous  time  domain  processing  of  ultrafast  waveforms,  three  ultrafast 
waveforms  are  introduced  to  the  SPD,  giving  rise  to  a  fourth  SDW  by  the  CSN  wave 
mixing  process.  The  effective  processing  of  the  resultant  ultrafast  waveform  depends  on 
the  configuration  of  the  SPD,  as  it  is  possible  to  control  the  directions  of  the  spatial 
dispersion,  enabling  new  signal  processing  functionality.  We  demonstrated  time  reversal 
of  ultrafast  waveforms  with  our  processor  by  both  phase  conjugation  and  spectral 
information  inversion.  The  input  complex  amplitude  ultrafast  waveform  consisted  of  a 
transform  limited  pulse  followed  by  a  positively  chirped  pulse.  Depending  on  the  time 
reversal  configuration,  the  two  pulses  exchanged  their  location  while  the  positively 
chirped  pulse  remains  indeed  positively  chirped. 

Relative  to  other  spatial-temporal  processing  techniques,  our  nonlinear  wave  mixing 
approaches  to  spatio-temporal  processing  provide  femtosecond  rate  processing  due  to  the 
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fast  bound  electron  nonlinearity  and  high  efficiency  on  account  of  a  relatively  large  |(2) 
coefficient  in  bulk  crystals.  The  spatial-temporal  process  that  we  have  demonstrated 
generate  output  spatial  and  temporal  waveforms  that  can  be  changed  in  real  time.  Since 
the  technique  realizes  a  general  wave  mixing  process  of  temporal  and  spatial  information¬ 
carrying  waves,  the  setup  may  be  converted  to  provide  the  convolution  or  correlation 
signal  between  spatial  and  temporal  channels,  with  the  output  in  either  the  temporal  or 
the  spatial  domain.  Thus,  this  spatial-temporal  process  can  be  considered  a  fundamental 
system  for  performing  ultrafast  signal  processing  on  optical  waveforms  in  the  time  and 
space  domain.  Currently,  we  investigate  techniques  for  characterization  of  the  spatial  and 
temporal  modes  in  the  fiber  optic  channel,  enabling  real  time  monitoring  and  control  of 
future  optical  network  systems.  We  launch  a  signal  wave  into  a  few-mode  optical  fiber, 
where  it  excites  several  spatial  modes,  each  propagating  with  corresponding  group 
velocity.  At  the  output  of  the  fiber  the  signal  wave  is  combined  with  the  reference  wave, 
and  the  total  optical  intensity  is  detected  using  a  CCD  camera.  Using  spatial  Fourier 
domain  filtering  we  decouple  the  interference  portion  of  the  total  observed  intensity  from 
the  background,  and  obtain  background-free  cross-correlation  of  the  reference  and  signal 
waves.  We  perform  this  procedure  for  various  relative  time  delays  between  the  signal  and 
the  reference,  and  obtain  spatio-temporal  impulse  response  of  the  fiber  (Fig.2). 


Fig.  2.  Spatio-temporal  modes  of  the  fiber. 


!'■ ' 
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80  Gb/sec  TDM  based  fiber  ICs  designs  using  InP  DHBT 

technology 

Mark  J.  W.  Rodwell 


Previously,  SiGe  and  InP  technologies  were  providing  an  ft,  fmax  ~  90  GHz  and  ft,  fmax  « 
140  GHz,  respectively.  Once  established  that  data  rates  for  fiber  chipsets  could  be 
developed  at  greater  than  10  Gb/sec  using  either  of  these  technologies,  a  race  began 
amongst  telecommunication  companies  to  be  the  first  to  bring  40  Gb/sec  TDM  based 
chipsets  and  equipment  to  the  market.  In  looking  at  what  will  be  needed  from  a  device 
technology  in  terms  of  speed  and  scaling  in  order  to  take  data  rates  to  80  Gb/sec  and 
beyond,  InP  is  years  ahead  of  SiGe  and  will  be  the  material  system  used  for  first 
generation  results  at  80  Gb/sec. 

Amongst  the  components  that  compose  a  fiber  IC  chipset,  the  electro-optic  (EO) 
modulator  driver  design  provides  a  unique  challenge  in  that  it  is  a  broadband  amplifier 
(DC-70GHz  bandwidth  for  80  Gb/sec)  that  needs  to  produce  5  Volt  drive  into  a  50  ohm 
load— -Imin  =  100  mA.  It  was  our  intent  to  design  such  a  modulator  driver  into  the 
aforementioned  material  systems  with  a  conventional  mesa  structure  for  operation  at  40 
Gb/sec,  and  UCSB’s  transferred  substrate  TS-DHBT  process  for  operation  at  80Gb/sec — 
only  the  latter  of  which  was  completed.  In  addition  to  the  modulator  driver,  we  pursued 
other  circuits  that  comprise  a  fiber  IC  chipset.  Design  and  layout  of  a  2: 1  multiplexer  and 
Cherry-Hooper  based  limiting  amplifier  (LIA)  were  completed  for  operation  at  100 
Gb/sec  in  the  TS-DHBT  process.  These  circuits  have  not  yet  been  fabricated. 

Lastly,  with  continued  processing  difficulty  and  low  circuit  yield  in  the  TS-HBT  process, 
our  device  team  has  developed  a  conventional  narrow  mesa  technology  that  utilizes  the 
epitaxial  engineering  behind  our  transferred  substrate  devices.  In  making  this  technology 
feasible  for  circuits  with  hundreds  of  devices,  we  have  retained  our  low  loss  dielectric, 
microstrip  wiring  environment  to  minimize  parasitic  loading. 


I.  Circuit  design  techniques 


a.  EO  Modulator  Driver 

For  the  existing  EO  modulator  drivers  at  40  Gb/sec  operation  that  are  available 
commercially,  a  drive  voltage  for  medium  halt  fiber  applications  is  about  3.5  Volts  into  a 
50  ohm  load.  If  a  conventional  lumped  design  is  used  and  back  terminated  with  50  ohms, 
we  would  need  to  switch  (3.5  V)/( 25  ohm)  =  140  mA  in  the  amplifier.  We  cannot 
simultaneously  have  devices  large  enough  to  pull  this  current,  dissipate  the  heat 
generated  by  140  mA  in  a  small  area,  and  have  the  bandwidth  we  need  to  satisfy  our 
design  goal — 75  GHz  for  80  Gb/sec. 
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The  solution  to  all  three  of  these  problems  is  to  use  a  distributed,  traveling  wave  TWA 
topology.  By  distributing  the  amplifier  into  smaller  cells  and  connecting  them  to  a 
transmission  lme  bus,  we  can  superimpose  the  current  contribution  of  each  cell  in  order 
to  meet  our  current  needs.  The  parasitic  input  and  output  capacitances  from  the  cells  are 
absorbed  by  the  transmission  line,  which  shifts  the  characteristic  impedance  to: 


Z  TL ,  synthetic  —  . .  . 

V  Ctl  +  Ctwa  , cell 

We  choose  Zjl  such  that  ZTL,symhetic  =  50  Q.  By  having  the  device  capacitances  part  of 
the  transmission  line,  our  frequency  limit  is  the  Bragg  cutoff  frequency  of  the  synthetic 
transmission  line.  This  is  a  design  variable  we  can  manage: 


f Bragg  — 


f Bragg  — 


LtL  ■  [Ctl  jf  Ctwa ,  Cell) 

71  *  Ztl  ,  syrtthetic  *  ( Ctl  +  Ctwa  Xell) 


If  the  Bragg  limit  needs  to  be  increased,  we  can  reduce  the  capacitive  contribution  of 
each  cell.  For  example,  on  the  input  line,  the  diffusion  capacitance  CBE  dominates 
Reducing  the  current  contribution  of  each  cell,  and  then  adding  more  cells  will  maintain 
the  same  output  current  and  increase  fBragg.  At  some  point  though  the  addition  of  more 
smaller  cells  will  not  contribute  to  the  gain.  The  conductive  losses  from  the  TWA  cells 
will  accumulate  because  it  looks  like  the  transmission  line  is  lossy.  Evaluating  the  gain 
bandwidth  product  of  the  TWA  with  conductive  losses  yields  an  interesting  result.  The 
gain  bandwidth  product  of  a  TWA  is  not  Avft — it  becomes  Ayf^ax* 


In  this  design,  the  number  of  cells  (10  cells)  and  device  sizes  were  picked  in  such  a  way 
as  be  able  to  drive  large  currents,  and  have  fBragg  =  75  GHz.  By  setting  the  Bra*g 
frequency  for  only  the  bandwidth  needed,  we  do  not  add  any  unnecessary  loss  by  having 
more  cells  and  this  keeps  the  overall  size  of  the  circuit  manageable,  for  an  already  large 
amplifier.  The  simulated  performance  of  this  amplifier  showed  an  S(2,l)  =15.2  dB,  fBra 
-114  GHz,  with  good  input  and  output  transmission  line  phase  matching — S(l,l)  and 

S(2,2)  <  -10  dB  up  to  70  GHz.  The  large  signal  eye  diagram  simulation  at  80  Gb/sec  is 
shown  m  Fig.  1. 


Fig.  1  Mod  driver  eye  diagram— 80Gb/sec,  3.5V  out 
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b.  2:1  MUX 


The  2:1  multiplexer  employs  traditional  single  level  ECL  gated  logic  with  Master/Slave 
latch  input  data  retiming  and  two  stage  differential  buffering  for  output  data  restoration. 
In  this  2: 1  MUX  design,  the  operating  speed  is  limited  by  the  ability  to  clock  the  retiming 
latches  and  serialize  the  two  data  streams  at  the  selector  circuit.  Both  the  retiming  latches 
and  the  selector  are  clocked  at  a  frequency  equal  to  the  rate  of  the  incoming  data. 

With  this  MUX  design,  the  slowest  part  within  it  is  the  time  of  flight  of  the  signal  through 
the  interconnects  of  the  retiming  latch.  Minimizing  the  delay  associated  with  this  circuit 
and  its  interconnects  is  what  will  maximize  the  operating  speed  of  the  MUX — this  is 
where  we  placed  our  efforts. 

To  do  this  optimization,  we  turned  our  attention  to  the  static  frequency  divider — the 
digital  benchmark  circuit  that  tests  the  limit  of  a  transistor  technology.  This  circuit  is  an 
M/S  latch  whose  output  is  fed-back  to  its  input  to  make  the  output  frequency  half  of  that 
of  the  source  clocking  the  circuit.  Through  careful  design  and  tight  layout,  a  maximum 
clocking  frequency  of  95  GHz  was  simulated.  With  this  design  and  optimization 
complete,  the  footprint  of  the  static  frequency  divider  was  configured  back  as  an  M/S 
latch  and  used  for  die  input  data  retiming  circuits  and  the  selector  in  the  MUX. 


Fig.  2  Schematic  of  Static  Frequency  Divider 

The  layout  of  the  2:1  MUX  contained  196  transistors.  With  layout  parasitics  included,  an 
error  free,  open  output  eye  diagram  of  100  Gb/sec  and  160  Gb/sec  were  simulated.  This 
is  quite  close  to  the  maximum  clock  frequency  of  the  static  frequency  divider,  thus 
validating  the  design  approach  taken.  At  output  data  rates  higher  than  160  Gb/sec,  errors 
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began  occurring  due  to  the  jitter  on  the  data  waveform  at  the  selector  input  from  the 
retiming  circuits. 


Fig.  3  2:1  MUX  eye  diagram — simulated  at  160Gb/s 


c.  4  stage  Cherry-Hooper  based  limiting  amplifier 

The  Cherry-Hooper  amplifier  through  its  use  of  transconductance  to  transimpedance 
transformation,  provides  excellent  gain  and  graceful  output  voltage  limiting  for  small  and 
large  signal  mputs.  A  detailed  analysis  of  the  time  constant  delays  of  the  circuit  shows 
tiiat  the  feedback  resistance  in  the  transimpedance  level  causes  a  very  small  amount  of 
back  loading  on  successive  stages — allowing  us  to  use  many  gain  stages  with  a  small 
penalty  in  circuit  bandwidth.  In  addition,  this  characteristic  makes  the  overall  first  order 
time  constant  of  the  circuit  quite  small.  Unfortunately,  the  feedback  does  little  to  reduce 
the  second  order  time  constant.  The  step  response  of  a  circuit  that  has  this  small  first 

order  /  large  second  order  time  constant  is  one  that  will  be  very  fast,  but  will  ring 
horribly.  & 

The  mam  design  challenge  then  comes  in  trying  to  reduce  the  second  order  term  without 
losmg  speed  due  to  increases  in  the  first  order  term.  By  selective  picking  of  the  device 
geometry  and  careful  distribution  of  the  gain  contributions  of  each  cell,  the  gain  bump 
was  removed  from  frequency  response  and  minimal  ringing  exhibited  in  the  eye  diagram 
Designing  with  our  transferred  substrate  DHBT  devices,  a  four  cell  Cheny-Hooper  LIA 
was  designed.  Simulated  performance  shows  an  S(2,l)  =  43  dB,  -3  dB  bandwidth  =  77 

GHz,  and  an  open  eye  diagram  for  Vm  =  10  and  200  mV  and  Vout  =  250  mV  at  140 
Gb/sec. 
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Fig.  5  LIA  output  eye  diagram — simulated  at 
140  Gb/sec  Yin  =  10  mV,  Vout  =  250  mV 


Fig.  6  LIA  output  eye  diagram — simulated  at 
140  Gb/sec  Vin  =  200  mV,  Vout  =  250  mV 
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H.  UCSB’s  InP  Mesa  DHBT  process 


We  began  the  development  of  a  traditional  mesa  HBT  that  could  exploit  the  epitaxial 
engineering  behind  our  TS-HBT  devices,  have  narrow  features  to  minimize  Ceb  parasitic 
loading,  and  reduce  process  complexity.  With  lithographic  tools  that  can  reliably 
produce  submicron  features  at  UCSB,  our  device  team  produced  first  generation  mesa 
DHBTs  with  0.5  um  wide  emitters  We,  and  0.5  um  wide  self-aligned  base  contacts  Wr 
(Fig.  7).  The  f,  and  fmax  were  >140  GHz. 


Fig.  7  Physical  profile  of  Narrow  Mesa  DHBT 


The  initial  device  results  from  this  process  were  very  encouraging  and  efforts  to  make 
this  technology  support  passive  components,  multi-level  interconnects,  and  circuits  were 
then  pursued.  Resistors  are  generated  by  evaporating  NiCr  of  thickness  to  produce  an 
Rsheet  =  40  QJ  ,  and  capacitors  are  realized  by  depositing  SiN  between  Ml  and  M2.  For 
the  wiring  environment,  we  spin  over  the  top  of  the  sample  BCB  (benzocyclobutene)— a 
low-loss,  low  8r  dielectric  that  can  be  patterned  for  the  microstrip  wiring  environment. 
Lastly,  using  a  patterned  top  surface  ground  plane,  we  electroplate  gold  onto  the  BCB  to 
complete  the  process.  In  special  instances  when  we  need  to  have  transmission  lines  carry 
large  currents,  we  can  pattern  the  ground  plane  to  support  microstrip  and  coplanar 
waveguide. 


Our  first  process  run  with  the  adopted  circuit  processing  steps  were  very  successful.  The 
epitaxial  layer  structure  was  designed  with  an  InP  emitter,  InGaAs  base  (400  A,  2el9  cm‘ 
Be  doped),  and  InP  collector  (2000  A).  All  of  the  devices  on  the  sample  had  0.5  um 
wide  emitters  and  lengths  of  4,  6,  and  10  um.  The  yield  on  the  devices  that  had  a  base 

contact  width  >  0.5  um  was  very  good.  Amongst  the  devices  that  had  a  0.3  um  base 
contact  width,  the  yield  was  less  than  50% 

The  maximum  current  density  that  could  be  supported  before  the  device  Kirk  limit  at  Vce 
—  0.7  V  was  Jkirk  =  2.25  mA/um2.  The  Rf  performance  of  the  better  devices  on  the  sample 
demonstrated  a  maximum  ft  =  205  GHz,  fmax  =  180  GHz,  at  a  Vce  =  1.5  V  and  J  =  3 
mA/um  .  In  addition  to  devices,  this  maskset  also  had  static  frequency  dividers  (36 
devices)  and  two  stage  Cherry-Hooper  amplifiers  (18  devices)  on  it  with  varying  base 
contact  widths  to  determine  how  narrow  a  contact  we  can  reliably  yield.  Unfortunately,  a 
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layout  error  was  discovered  on  the  static  frequency  dividers — they  did  not  work. 
Amongst  the  Cherry-Hooper  amplifiers,  the  yield  was  low  because  of  a  processing  error 
during  the  electroplating  step.  For  those  that  did  work,  the  Rf  performance  was  in  good 
agreement  with  the  simulated  circuit.  This  circuit  had  an  S(2,l)  =  7  dB  with  a  -3dB 
frequency  at  35  GHz. 

Lastly,  test  structures  were  fabricated  for  further  characterization  of  the  device  yield  with 
this  process.  Series  and  parallel  diode  test  structures,  and  arrays  of  base-emitter  and 
collector-base  diodes  (100  diodes  in  parallel)  had  good  yield,  the  only  failure  mechanism 
showing  an  open  circuit — a  consequence  of  the  same  electroplating  error.  In  addition  to 
this  success,  our  device  team  has  recently  yielded  mesa  DHBTs  with  a  carbon  doped, 
300A,  graded  base  doping  profile,  and  1500  A  collector,  that  has  generated  an  ft  «  280 
GHz  and  fmax  >  400  GHz  at  bias  conditions  of  Vce  =  2.0  V  and  current  density  of  10 
mA/um2.  This  level  of  device  performance  makes  the  pursuit  of  fiber  ICs  at  data  rates  of 
160  Gb/sec  a  realizable  quest. 


HI.  Conclusion 


Our  efforts  to  design  fiber  ICs  into  UCSB’s  transferred  substrate  DHBT  process  showed 
simulated  results  that  validated  the  pursuit  of  next  generation  fiber  ICs  at  data  rates  of  80 
Gb/sec  and  160  Gb/sec.  To  decrease  process  difficultly  and  increase  the  yield  of  circuits 
with  a  device  count  in  the  hundreds,  we  have  aggressively  pursued  a  narrow  mesa  DHBT 
technology  with  speed  comparable  to  our  transferred  substrate  devices.  Continued 
improvements  in  the  epitaxial  layer  structure  and  scaling  will  then  give  us  a  traditional 
device  with  record  speed,  which  is  what  is  needed  to  make  the  circuits  at  160  Gb/sec 
realizable. 
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Design  and  Evaluation  of  the  Fibonacci  Optical  ATM  Switch 

P.M.  Melliar-Smith,  and  Louise  E.  Moser 


Electronic  ATM  switches  are  the  bottleneck  of  an  ATM  network:  ATM  cells  are  not 
switched  fast  enough  to  match  the  transmission  speed  of  the  optical  channels.  It  is 
expected  that  the  gap  between  die  electronic  switching  and  optical  transmission  speeds 
will  continue  to  increase  as  optical  transmission  technology  matures.  A  solution  to  this 
problem  !S  to  implement  ATM  switches  using  optical  technology.  The  major  functions  of 
an  ATM  switch  are  to  route  incoming  cells  to  the  appropriate  output  channels  and  to 
provide  buffering  for  them,  if  necessary.  The  construction  of  a  single  monolithic  fabric  to 
interconnect  large  numbers  of  input  and  output  channels  may  not  be  feasible  due  to 
physica1  and  cost  contraints.  Consequently,  the  switching  fabric  of  a  high-performance 
ATM  switch  typically  consists  of  a  network  of  smaller  switching  elements,  as  shown  in 

ig.  1,  with  each  small  switching  element  having  its  own  buffers,  optical  crossbar  switch 
and  control  unit,  as  shown  in  Fig.  2. 


Fig.  1:  Fig.  1.  A  4-input  and  4-output  switch  fabric  with  two  stages  of  2  x  2 
switching  elements  and  an  8-input  and  8-output  switch  fabric  with  three 
stages  of  2  x  2  switching  elements. 

Such  an  organization  lends  itself  naturally  to  parallel  processing  of  cells  and  parallel 
routmg  decisions.  In  optical  switches,  static  buffering  is  not  feasible  and  buffers  are 
constructed  by  switching  ATM  cells  into  optical  fibers  that  act  as  delay  lines.  The  longer 
the  fiber,  the  longer  the  delay  and  the  greater  the  number  of  cells  buffered  in  the  fiber  By 
switching  an  ATM  cell  through  a  sequence  of  fiber  delay  lines,  any  required  delay  can  be 
achieved.  The  difficulty  of  building  an  optical  crossbar  switch  increases  rapidly  with  the 
number  of  fibers.  Consequently,  it  is  important  to  minimize  the  number  of  fiber  delay 
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lines  while  still  achieving  efficient  operation  of  the  switch.  Although  packet  switch 
architectures  using  optical  delay  fibers  as  buffers  have  been  proposed  in  the  literature,  as 
far  as  we  can  determine,  studies  on  how  to  design  an  optical  switch  that  addresses  ATM- 
specific  requirements  while  requiring  few  fibers  and  low  processing  complexity  do  not 
exist. 


Fig.  2:  A  2  x  2  optical  switching  element. 

In  this  project,  we  developed  the  design  of  a  novel  optical  ATM  switch  architecture  [1]. 
Because  of  the  remarkable  properties  of  the  Fibonacci  numbers,  the  architecture  is 
computationally  efficient,  requires  few  fibers,  and  yields  low  latency  and  cell  loss  rate 
across  diverse  time-correlated  and  heavily  loaded  traffic  profiles.  The  buffer  sets 
corresponding  to  an  empirically  lossless  2x2  ATM  switch  architecture  under  diverse 
traffic  scenarios  have  been  presented.  The  architecture  exhibits  the  desirable 
characteristics  that  the  cell  loss  rate  decreases  exponentially  as  the  number  of  fibers 
increases,  that  the  probability  mass  function  for  the  sample  latency  decays  rapidly,  and 
that  an  exponential  increase  in  the  time  correlation  of  traffic  can  be  offset  by  a  small 
increase  in  the  number  of  fibers.  Because  of  these  salient  characteristics,  this  architecture 
may  play  an  integral  role  in  future  optical  ATM  networks. 

[1]  Allen  Shum,  P.M.  Melliar-Smith,  and  Louise  E.  Moser,  Design  and  Evaluation  of  the 
Fibonacci  Optical  ATM  Switch,  IEEE  TRANSACTIONS  ON  COMPUTERS,  VOL.  50, 
P.  466,  MAY  2001 
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IV.  MUM  Success  Stories 


Optical  Switching 

Calient  Networks 
Goleta,  CA 

John  Bowers,  Dan  Blumenthal,  Founders 


The  central  focus  of  MOST  was  the  development  of  optical  switches.  These  switches 
utilized  InP  technology  to  make  4x4  nonblocking  switch  arrays.  John  Bowers  and  Dan 
Blumenthal  started  Calient  Networks  to  commercialize  optical  switches  for  the 
telecommunications,  test  and  military  markets  and  hired  a  number  of  MOST  graduates  to 
develop  optical  switch  technology  (Olivier  Jerphagnon,  Jonathan  Lang,  Bin  Liu,  Adrian 
Keating). 

A  typical  node  in  an  fiber  optic  network  typically  has  tens  of  fibers,  and  each  fiber 
typically  has  tens  of  wavelengths.  Hence  the  need  is  for  nonblocking  switches  with  large 
port  counts,  typically  100-1000.  The  2D  technology  used  in  all  N  port  crossbar  optical 
switches  up  to  the  founding  of  Calient  requires  N2  2x2  switches  (Fig.  1).  This  is  true  for 
LiNb03,  InGaAsP,  GaAs,  and  thermo-optic  switches.  This  becomes  prohibitively 
complicated  and  expensive  for  any  switch  with  more  than  32  ports.  Calient  developed  a 
3D  technology  (Fig.  2)  that  utilizes  MEMS  technology  to  solve  this  problem.  In  this 
case,  only  2N  switches  are  required.  Hence  a  256  port  switch  only  requires  512  mirrors 
rather  than  the  65,536  switches  required  using  2D  technology.  This  switch  can  switch 
data  at  10  Gbit/s  (Fig.  3),  at  40  Gbit/s  (Fig.  4),  and  should  scale  to  160  Gbit/s.  This 
switch  has  been  used  to  switch  an  aggregate  of  10  Tbit/s  without  degradation.  This 
technology  promises  to  revolutionize  the  way  wavelength  division  multiplexed  optical 
networks  are  designed,  deployed  and  the  way  bandwidth  is  managed  within  the  core  and 
metropolitan  networks.  A  complete  system  containing  redundant  switches,  network 
control  software,  input  and  output  monitoring  is  shown  in  Fig.  5.  Calient  currently 
employs  95  people,  and  has  sold  systems  to  Japan  Telecom,  KDDI,  Juniper  Networks  and 
Oki  Networks. 


39 


Number  of  Switches 


BER 


Figure  2  Schematic  diagram  of  a  3D  MEMS  switch.  The  input  and  output  fiber 
collimators  are  shown  along  with  the  two  MEMS  arrays. 
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Figure  3  Performance  of  the  switch  at  40  Gbit/s. 
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Figure  4  Comparison  of  bit  error  rate  in  a  40  Gbit/s  link  with  and  without  a  256  port 
3D  MEMS  switch  inserted. 


Figure  5  Diamondwave  256:  A  256  port  photonic 


Tunable  Lasers 


Agility  Communications 
Goleta,  CA 

Larry  Coldren,  Greg  Fish,  Founders 


Researchers  at  the  University  of  California,  Santa  Barbara  have  investigated  the 
design,  process  and  integration  of  semiconductor  wavelength  tunable  lasers  and  their 
application  to  telecommunications  and  phased  array  radar  systems.  These  lasers  provide 
the  capability  to  tune  very  narrow  line-width  emission  over  a  large  range  of  output 
wavelengths  across  THz  of  bandwidth.  These  advantages  will  have  impact  across  a  broad 
array  of  applications  including  sensors  and  bio-monitoring,  phased  array  radar  and 
communications  networks.  Military  platforms  that  are  sensitive  to  communications 
equipment  weight,  size  and  power  dissipation  will  benefit  from  this  technology. 
Applications  to  advance  Internet  optical  communications  in  both  the  military  and 
commercial  sectors  are  foreseen. 

Significant  progress  was  made  on  high  levels  of  integration  based  on  indium 
phosphide  integrated  optic  waveguide  technology.  Integration  of  tunable  lasers  with 
other  critical  functions  like  modulation,  wavelength  monitoring,  optical  amplification  and 
all-optical  wavelength  conversion  were  demonstrated  during  the  course  of  this  work. 
Very  high-speed  wavelength  switching  over  more  than  50  wavelengths  shows  promise  to 
adapt  this  technology  to  optical  packet  switched  applications. 

The  basic  -  research  results  of  this  wavelength  tuning  technology  has  been 
commercialized  at  Agility  Communications,  a  UCSB  spin-out  company  that  employs 
over  200  people  in  the  States  of  California  and  Pennsylvania.  A  picture  of  the  Sampled 
Grating  Distributed  Bragg  Reflector  (SGDBR)  wavelength  tunable  structure,  example 
tunable  wavelength  output  and  the  Agility  tunable  laser  product  are  shown  in  Figure  1. 
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Figure  1.  Phased  Array  Radar  applications  that  can  leverage  advances  in  tunable  laser 
integration  technology. 
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Wafer  Fusion:  Materials  Issues  and  Device  Results 

A.  Black,  A.  R.  Hawkins,  N.  M.  Margalit,  D.  I.  Babic,  A.  L.  Holmes,  Jr., 

Y.-L.  Chang,  P.  Abraham,  J.  E.  Bowers,  and  E.  L.  Hu 


Abstract — A  large  number  of  novel  devices  have  been  recently 
demonstrated  using  wafer  fusion  to  integrate  materials  with 
different  lattice  constants.  In  many  cases,  devices  created  using 
this  technique  have  shown  dramatic  improvements  over  those 
which  maintain  a  single  lattice  constant.  We  present  device  results 
and  characterizations  of  the  fused  interface  between  several 
groups  of  materials. 

Index  Terms — Detectors,  optical  fiber  communication,  semicon¬ 
ductor  device  bonding,  semiconductor  device  fabrication,  semi¬ 
conductor  heterojunctions,  semiconductor  lasers. 

L  Introduction 

N  EACH  material  system,  nature  has  dictated  a  set  of 
physical  properties  (i.e.,  mobility,  optical  absorption,  ther¬ 
mal  conductivity,  and  resistivity).  For  a  given  application,  the 
optimal  properties  may  not  reside  in  a  single  material,  but  in 
a  variety  of  disparate  materials.  To  this  end,  high  performance 
semiconductor-based  electronic  and  optoelectronic  device  de¬ 
sign  can  be  greatly  enhanced  if  we  can  freely  integrate 
heterogeneous  materials.  An  effective  integration  method  has 
remained  a  challenge  due  to  lattice  constant  mismatch  between 
semiconductor  systems.  “Monolithic  techniques”  such  as  the 
epitaxial  or  pseudomorphic  growth  of  mismatched  materials 
can  result  in  highly  defective  layers  which  degrade  or  inhibit 
device  operation. 

Recently,  there  has  been  exciting  progress  in  the  solution 
to  the  problem  of  matching  disparate  materials  to  optimize 
device  performance:  a  process  termed  “wafer  fusion”.  Fusion 
bonding  is  a  special  case  of  direct  bonding  in  which  chemical 
bonds  are  established  directly  between  two  materials  at  their 
hetero-interface  in  the  absence  of  an  intermediate  layer.  For 
fusion  bonding  to  occur  in  semiconductors,  two  oxide-free 
(and  contaminant-free)  crystals  are  directly  bonded  and  then 
annealed  to  yield  a  crystalline  junction. 

Fig.  1  shows  the  range  of  wafer  applications  that  have 
been  currently  addressed.  In  many  cases,  record  performances 
have  been  achieved  as  a  result  of  using  fusion  to  eliminate 
the  limitations  imposed  by  lattice  matched  epitaxy.  Moving 
clockwise  around  the  chart  starting  in  the  upper-left  comer, 
bonding  GaAs-Al As  mirrors  to  InGaAsP  active  regions  solves 
the  fundamental  limits  in  InP-InGaAsP  mirror  reflectivities  in 
vertical  cavity  lasers  [1].  Deeper  quantum  wells  with  strong 
optical  confinement  can  be  achieved  by  using  AlGaAs  as  the 
outer  confining  layer  for  in-plane  InGaAsP  lasers.  The  problem 
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Fig.  1.  Applications  of  fusing  and  bonding  processes  in  optoelectronics  and 
electronics. 


of  cleaving  perpendicular  facets  in  hexagonal  crystals  can  be 
solved  by  fusing  to  a  cubic  crystal  and  removing  most  of 
the  hexagonal  crystal  [2].  Hewlett  Packard  has  used  fusing 
in  light-emitting  diodes  (LED’s)  to  replace  an  absorptive 
substrate  with  a  transparent  substrate,  and  also  to  reduce 
the  substrate  thermal  resistance  [3].  Bellcore  has  used  fusing 
to  increase  the  four-wave  mixing  power  in  waveguides  for 
wavelength-division  multiplexing  (WDM)  applications  [4]. 
Oki  and  other  companies  have  wafer  fused  III-V  sources 
and  detectors  to  Si  with  the  goal  of  optoelectronic-integrated 
circuits  (OEIC)  [5].  Several  groups  have  used  GaAs-AlAs 
mirrors  to  get  sharper  bandpass  features  in  WDM  detectors 
[6],  [7].  Finally,  improved  silicon  telecommunication  APD’s 
have  been  demonstrated  by  combining  the  ideal  avalanche 
characteristics  of  silicon  with  the  absorption  characteristics  of 
InGaAs  to  get  higher  gain  bandwidth  products,  lower  noise 
and  reduced  temperature  sensitivity  [8]. 

This  paper  will  begin  with  a  brief  description  of  the  fusion 
process  in  Section  II.  Section  III  will  be  a  description  of 
various  device  applications  utilizing  wafer  fusion.  The  device 
results  have  thus  far  been  dramatically  successful,  but  point  out 
critical  issues  that  need  to  be  explored.  Section  IV  discusses 
some  of  these  issues,  as  well  as  the  electrical  and  optical 
characterization  of  wafer  fused  interfaces. 

II.  Wafer  Fusing:  Process  Description 

The  technique  of  fusion  bonding  consists  of  three  steps: 
surface  preparation,  placement  of  the  substrates  into  contact 
with  each  other,  and  an  elevated  temperature  anneal  while 
the  substrates  are  maintained  in  contact  under  pressure.  The 
surface  preparation  steps  include  removal  of  oxides  and  or¬ 
ganic  contaminants,  as  well  as  chemical  activation  of  the 
surface  for  interfacial  bonding.  Bonding  by  Van  der  Waals 
forces  occurs  when  two  such  clean  and  smooth  macroscopic 
bodies  are  placed  in  contact,  while  chemical  reactions  and 
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re-crystallization  takes  place  during  the  elevated  temperature 
anneal  [9]. 

There  are  a  number  of  techniques  used  for  surface  prepa¬ 
ration  and  placing  substrates  in  intimate  contact.  Bonding 
may  be  classified  as  “wet”  or  “dry.”  In  wet  bonding,  the 
samples  are  put  together  and  pressure  is  applied  on  them 
while  the  samples  are  still  immersed  in  an  oxide  removing  or 
passivating  chemical.  In  the  dry-bonding  technique,  the  oxides 
are  removed  using  a  suitable  chemical,  and  the  samples  are 
placed  in  contact  after  the  surfaces  have  been  dried.  Surface 
oxidation  must  be  prevented  when  using  the  latter  technique.  In 
Si  bonding,  hydrofluoric  acid  is  often  used  in  this  application 
since  it  leaves  a  hydrogen-terminated  (hydrophobic)  silicon 
surface  [10];  HF  is  also  known  to  leave  an  oxide-free  InP 
surface  [11].  Surface  preparation  may  also  be  carried  out  in 
an  environment  with  reduced  oxygen  partial  pressure.  In  both 
wet  and  dry  bonding,  it  is  necessary  to  allow  the  chemicals 
adsorbed  at  the  surface  to  escape  before  the  bonding  occurs. 
One  way  of  achieving  this  is  to  etch  “escape  channels”  into 
the  substrate.  This  technique  has  been  successfully  used  in  a 
number  of  InP-GaAs  fusion  experiments  [12],  and  in  silicon 
direct  bonding  [13].  If  the  surface  is  not  patterned,  the  trapped 
liquids  and  gases  produce  large  scale  bubbles  (size  MOO  ^m) 
[14],  and  a  large  density  of  microscopic  voids  or  oxide  islands 
(size  M  /im).  After  channel  etching  and  cleaning,  the  surface 
preparation  typically  consists  of  sequential  oxidation  and  oxide 
removal  steps.  The  surface  may  be  oxidized  by  oxygen  plasma, 
UV-ozone  oxidation  [15],  or  wet  chemical  oxidation  (hydrogen 
peroxide),  while  the  oxide  removal  may  be  achieved  using 
various  acids  and  bases  [16]— [18].  After  the  samples  are  placed 
into  contact,  they  are  subjected  to  an  annealing  step  at  an 
elevated  temperature  so  that  a  crystalline  junction  is  formed. 
The  process  of  fusion  bonding  of  GaAs  and  InP  (or  other 
Ga-In-As-P  materials)  is  not  fully  understood,  but  is  believed 
to  rely  on  the  proximity  annealing  [19]  of  InP  with  a  GaAs 
cap.  As  phosphorus  desorbs  from  the  InP  surface  and  fills 
the  gaps  in  the  fused  junction,  indium  diffuses  laterally  and 
fills  the  voids,  ultimately  reacting  with  the  phosphorus  to  form 
an  InGaAsP  alloy  [20],  [21].  Phosphorus  and  indium  are  the 
primary  mobile  species  during  fusion  bonding  of  GaAs  and 
InP  [12].  The  pressure  maintained  on  the  two  wafers  to  be 
bonded  helps  to  even  out  the  surface  nonuniformities. 

The  fusion  anneal  of  GaAs-InP  junctions  is  performed 
under  pressure  and  elevated  temperature  in  specially  designed 
fixtures.  The  sample  sizes  used  in  experiments  vary  from 
less  than  1  cm2  to  full  2"  wafers  [22],  [23].  The  fusion 
pressure  reported  in  the  literature  varies  over  a  wide  range, 
from  relatively  small  pressures  of  3  kPa  [17],  [18],  30  kPa 
[24],  100  kPa  [25],  to  -3  MPa  [12],  [20],  [22],  [26].  The  key 
factor  in  the  effective  application  of  the  force  on  the  samples 
is  its  uniformity.  The  fusion  temperatures  for  GaAs-InP  fusion 
range  between  550  °C-650  °C,  with  peak  temperatures  usually 
held  for  less  than  30  min.  Higher  pressures  should  result  in 
the  ability  to  carry  out  lower  temperature  fusing. 

The  most  common  substrate  orientation  used  for  fusion 
bonding  is  (001),  but  many  other  combinations  have  been 
explored  [17].  In  particular,  the  smooth  surface  moiphology  of 
epitaxially  grown  layers  is  critical  for  successful  fusing.  This 
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Fig.  2.  X-ray  diffraction  spectra  of  two  samples  consisting  of  three  compres- 
sively  strained  InGaAsP  QW’s  with  unstrained  InGaAsP  barriers.  One  sample 
is  grown  on  a  2° -off  (100)  InP  substrate  whereas  the  other  one  is  grown  on 
a  0.2° -off  substrate. 

may  in  turn  dictate  growth  on  substrates  slightly  misoriented 
from  (001).  We  have  found  this  to  be  true  in  MOCVD  growth 
of  InP-based  materials  for  fusing.  The  MOCVD  growths 
employ  trimethylindium  (TMI),  trimethylgallium  (TMG),  ter- 
tiarybutylarsine  (TBA)  and  tertiarybutylphosphine  (TBP),  us¬ 
ing  growth  temperatures  ranging  from  610  °C-650  °C,  at 
either  atmospheric  pressure  or  350  toix.  More  details  about 
the  growth  conditions  can  be  found  in  reference  [19].  Tear¬ 
drop-like  hillock  defects*  can  result  in  the  grown  material, 
resulting  from  preferential  nucleation  on  the  screw  dislocations 
emerging  at  the  surface  of  the  crystal.  A  misorientation  of 
the  crystal  surface  with  respect  to  the  exact  (100)  orientation 
creates  monoatomic  steps  that  are  also  preferential  adsorption 
sites,  making  it  possible  for  the  growth  to  proceed  without 
the  formation  of  hillock  defects.  However,  too  high  a  misori¬ 
entation  can  give  rise  to  step  bunching,  also  creating  surface 
roughness.  The  optimum  misorientation  will  produce  smooth 
surface  morphology;  this  depends  on  growth  parameters  that 
influence  the  diffusion  length  of  the  atomic  species  at  the 
surface:  temperature,  growth  rate  and  V-III  ratio  [27].  We 
chose  a  misorientation  of  0.2°.  Fig.  2  shows  the  X-ray  diffrac¬ 
tion  spectra  of  two  samples  consisting  of  three  compressively 
strained  InGaAsP  QW’s  with  unstrained  InGaAsP  barriers. 
One  sample  is  grown  on  a  2°  off  substrate  whereas  the  other 
one  is  grown  on  a  0.2°  off  substrate.  The  comparison  of 
the  two  spectra  clearly  show  that  the  crystallographic  quality 
of  the  2°-off  sample  is  not  as  good  as  that  of  the  0.2° 
sample.  Moreover,  the  full-width  at  half-maximum  (FWHM) 
of  the  photoluminescence  (PL)  peak  of  the  2°-off  sample  is 
larger  (about  double)  than  that  of  the  0.2°  off  sample.  This 
results  from  the  rougher  surface  of  the  former  sample,  with 
attendant  thickness  variations  that  increase  the  widths  of  the 
luminescence  peaks. 

The  number  of  dangling  bonds  and  quality  of  the  fused 
interface  directly  influences  the  electrical  characteristics  [21]. 
In  order  to  achieve  a  near-perfect  crystal  continuation,  the 


HAWKINS  et  al:  WAFER  FUSION:  MATERIALS  ISSUES  AND  DEVICE  RESULTS 


945 


two  substrates  should  be  enantiomorphic.1  Since  zinc-blend 
crystals  have  S4  point-group  symmetry,  the  two  substrates  (A 
and  B)  to  be  fused  must  be  aligned  in  such  a  way  that  cleaved 
(Oil)  planes  of  substrate  A  are  perpendicular  to  cleaved  (Oil) 
planes  of  substrate  B.  This  substrate  orientation  is  called  “in- 
phase”  bonding  [18].  If  the  two  (001)  cleavage  planes  of  the 
two  substrates  are  kept  parallel  in  the  alignment,  then  we  have 
“antiphase”  bonding  [18].  Recently,  a  form  of  a  compliant 
substrate  has  been  investigated  using  bonding  of  twisted 
thin  GaAs  epitaxial  layers  [17].  This  compliant  substrate 
promises  to  transcend  the  lattice  matching  requirement  and 
enables  growing  epitaxial  layers  on  top  of  nonlattice  matched 
substrates. 

in.  Device  Realizations  of  Wafer  Fusion 

A.  Vertical-Cavity  Lasers  and  Resonant  Cavity  Photodetectors 

Wafer  fusion  has  had  an  important  impact  in  developing 
long  wavelength  surface  normal  devices,  such  as  vertical- 
cavity  lasers  (VCL’s)  and  resonant  cavity  photodetectors. 
VCL’s  are  potentially  important  sources  for  fiber  optic  com¬ 
munications.  They  offer  high  modulation  bandwidths  at  low- 
bias  currents  as  well  as  ease?  in  testing  and  packaging.  Recent 
work  in  nonwafer-fused  GaAs-based  VCL’s  has  shown  record 
power  conversion  efficiency  as  well  as  record  low-threshold 
currents  (57%  and  40  pA,  respectively)  [28],  [29].  Lasers 
which  emit  at  wavelengths  between  850-980  nm  do  not 
match  up  well  with  the  minima  in  dispersion  and  loss  of 
standard  optical  fibers.  To  effectively  use  VCL’s  for  fiber¬ 
optic  communications,  sources  with  wavelengths  near  1.3 
and  1.5  pm  must  be  fabricated.  This  was  initially  found 
to  be  a  difficult  task  since  the  lattice-matched  InP-InGaAsP 
mirror  materials  had  reflectivities  and  thermal  conductivities 
too  poor  to  achieve  room  temperature  CW  operation.  Wafer 
fusion  allows  the  integration  of  InP-based  active  regions  with 
high  reflectivity  GaAs-AlAs  based  mirrors  to  achieve  high 
performance  operation. 

Fig.  3  shows  the  theoretical  reflectivity  as  a  function  of  the 
number  of  mirror  periods,  for  two  types  of  epitaxial  DBR 
mirrors  available  for  long  wavelength  devices.  The  maximum 
reflectivity  is  limited  by  absorptive  losses  in  the  mirrors. 
The  InP-InGaAsP-based  mirrors,  with  lower  differences  in 
the  indices  of  refraction,  have  a  larger  optical  penetration 
depth  and  thus  lower  maximum  reflectivity.  High-performance 
operation  of  a  VCL  requires  mirror  reflectivities  well  above 
99%.  High-mirror  reflectivities  also  reduce  the  necessary  gain 
for  lasing,  which  in  turn  permits  higher  temperature  operation. 

The  first  room  temperature  CW  operation  of  any  long- 
wavelength  VCL  over  980  nm,  was  demonstrated  by  Babic 
et  ai,  using  two  wafer  fused  GaAs-Al(Ga)As  mirrors  along 
with  an  InP-InGaAsP  1.55  pm  based  active  region  [30].  The 
threshold  and  quantum  efficiency  of  that  laser  was  limited 
by  optical  scattering  from  the  mesa  sidewalls.  A  significant 
improvement  in  all  characteristics  can  be  achieved  with  the 
addition  of  an  AlAs  oxidation  layer,  which  can  be  used  to 

1  Enantiomorph:  either  of  a  pair  of  chemical  compounds  or  crystal  whose 
molecular  structures  have  mirror  image  relationship  to  each  other. 


Number  of  Periods 

Fig.  3.  Comparison  of  the  reflectivity  that  can  be  achieved  with  different 
quarter-wave  minor  stacks. 


AuTi  p-contact 


Fig.  4.  Schematic  structure  of  double-fused  laterally  oxidized  1.55-//m  VCL 
operating  to  64°  C  CW. 


form  a  dielectric  aperture  to  confine  the  current,  and  provide 
improved  confinement  of  the  optical  mode,  away  from  the 
rough-etched  sidewalls  of  the  VCL  pillars  [1].  This  structure 
is  shown  in  Fig.  4. 

The  addition  of  a  dielectric  aperture  allowed  CW  operation 
up  to  64  °C  and  pulsed  operation  up  to  100  °C.  Further  device 
improvements  should  soon  allow  for  CW  operation  up  to  the 
85  °C  necessary  for  commercial  applications.  These  wafer- 
fused  1.55-^m  VCL’s  have  recently  been  used  in  transmission 
experiments,  with  data  rates  as  high  as  2.5  Gb/s  over  200  km 
of  optical  fiber  [31]. 

Any  surface  normal  device  that  requires  a  high  reflectiv¬ 
ity  DBR  mirror  along  with  an  InP  active  region  can  take 
advantage  of  the  wafer  fusion  of  GaAs  to  InP.  Resonant 
cavity  photodetectors  have  a  narrow  absorption  bandwidth 
which  is  determined  by  the  finesse  of  the  cavity  in  which  the 
absorption  layer  is  placed.  High-reflectivity  mirrors  are  needed 
to  achieve  a  narrow  absorption  line.  Such  devices  have  been 
fabricated,  yielding  a  record  94%  quantum  efficiency  with  a 
14-nm  absorption  bandwidth  [7]. 

B .  Long-Wavelength,  In-Plane  Lasers 

Wafer  fusion  has  clearly  been  a  critical  factor  in  the  realiza¬ 
tion  of  long-wavelength  surface  emitting  lasers.  The  benefits  of 
this  process  also  extend  to  in-plane  lasers  operating  at  1.55  pm. 
Long  wavelength  in-plane  lasers  behave  quite  differently  from 
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Fig.  5.  Proposed  structure  for  improved  performance  of  1.55-/un  in-plane 
lasers. 

their  980-nm  InGaAs-AlGaAs  counterparts.  The  characteristic 
temperature,  or  T0  of  980-nm  lasers  is  >200  K  [32],  [33] 
while  the  Tc  values  for  1.55  /im  lasers  are  in  the  range  of 
40-70  K  [34],  [35],  In  addition,  the  1.55-/jm  lasers  cannot 
be  modulated  as  quickly  nor  work  at  as  high  an  operating 
temperature  as  the  980-nm  lasers  [32],  [36]— [38].  In  the  last 
decade  much  effort  has  been  expended  to  better  understand 
this  discrepancy.  Historically,  Auger  nonradiative  recombina¬ 
tion  has  been  designated  as  the  principal  cause  of  the  poor 
temperature  performance  [39],  but  recent  work  highlights  the 
role  of  poor  electron  confinement  in  the  GalnAsP-InP  material 
system  [40]— [42],  We  have  earned  out  detailed  theoretical 
modeling  to  analyze  the  relative  magnitude  of  these  effects. 
Our  results  suggest  that,  while  Auger  effects  do  indeed  limit 
device  performance,  a  two-fold  increase  in  Tc  can  be  achieved 
through  a  reduction  in  the  amount  of  carrier  overflow  [43]. 
This  would  also  lead  to  an  increase  in  the  maximum  operating 
temperature  and  mitigate  the  decrease  in  the  slope  efficiency 
of  the  laser. 

Working  with  lattice-matched  (to  InP)  1.55-^m  materials 
alone,  the  enhanced  electron  confinement  can  only  be  achieved 
with  an  ensuing  severe  reduction  in  the  optical  overlap  of  the 
lasing  mode.  Wafer  fusion  allows  us  to  engineer  increased 
electron  confinement,  without  compromising  optical  confine¬ 
ment.  We  propose  the  improved  in-plane  laser  structure  shown 
in  Fig.  5,  where  Alo.7Gao.3As  forms  the  cladding  layer,  rather 
than  the  more  conventional,  lattice-matched  InP  or  InAlAs 
material.  With  the  lower  index  of  refraction  of  the  AlGaAs 
[44],  the  optical  confinement  of  the  laser  can  be  kept  high 
while  the  electron  confinement  is  also  increased.  This  is  shown 
in  Fig.  6  for  a  three  quantum-well  active  region. 

The  theoretical  improvement  in  high  temperature  perfor¬ 
mance  is  significant,  as  shown  in  Fig.  7.  Here,  we  calculate 
the  characteristic  temperature  as  a  function  of  the  Auger 
coefficient.  The  elimination  of  carrier  overflow  by  fusing  to 
GaAlAs  layers  results  in  significantly  higher  To,  even  if  the 
Auger  coefficient  is  large.  Performance  comparable  to  GaAs 
may  eventually  be  possible. 

These  wafer-fused  in-plane  lasers  may  be  more  sensitive 
to  the  fused  interface  than  the  wafer-fused  VCL’s  discussed 


Fig.  6.  Optical  confinement  comparison  for  a  three  quantum-well  active 
region  with  InP  and  AlGaAs  cladding  layers. 


(a)  (b) 

Fig.  7.  Comparison  of  7o  when  carrier  overflow  is  present  or  eliminated  by 
fusing  for  (a)  three  and  (b)  six  quantum  wells. 

earlier.  In  this  case,  the  lasing  mode  will  interact  more 
heavily  with  the  fused  interface:  contamination,  doping  spikes, 
dislocations,  and  other  irregularities  at  this  interface  could 
provide  additional  scattering  losses  for  the  laser.  The  structure 
shown  in  Fig.  8  was  used  to  assess  the  extent  of  those  losses. 
The  active  region  consists  of  four  55-A  GalnAsP  quantum 
wells  with  a  compressive  strain  of  0.7%.  These  quantum  wells 
are  separated  by  10  nm  barriers  of  lattice-matched  GalnAsP 
with  a  bandgap  of  1.25  yum;  this  composition  of  GalnAsP 
also  serves  as  the  waveguide  material.  The  total  waveguide 
thickness  is  300  nm,  with  a  500-A  p-InP  layer  cap  layer.  This 
active  region  was  grown  by  MOCVD  under  conditions  similar 
to  those  discussed  earlier.  The  cladding  layer  was  grown  by 
MBE  and  consisted  of  a  50  nm  p-GaAs  layer  for  fusion, 
a  1.5-fim  Alo.7Gao.3As  cladding  layer,  and  a  1500  A  p+- 
GaAs  contact  layer.  The  fusion  conditions  were  a  pressure  of 
approximately  2.5  MPa,  a  fusion  temperature  of  630  °C,  and 
a  fusion  time  of  30  min.  For  comparison  purposes,  a  piece 
of  the  active  region  had  2  yum  of  p-InP  regrown  to  make  a 
laser  structure  with  InP  cladding.  In  addition,  this  structure 
was  taken  through  the  same  temperature  cycle  as  the  fusion 
structure.  These  structures  were  processed  into  broad-area 
devices  (50-yum  stripes)  using  standard  procedures. 

Preliminaiy  results  for  the  structure  indicate  a  higher  thresh¬ 
old  for  the  fused  laser  of  approximately  1600  A/cm2  while 
the  reference  structure  is  lower  at  610  A/cm2.  In  addition,  the 
slope  efficiency  in  the  fusion  case  is  smaller  suggesting  that 
high  internal  losses  are  the  cause  of  the  increase  in  threshold 
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Fig.  8.  Test  structure  for  the  effect  of  wafer  fusion  on  1.55-//m  in-plane 
lasers. 


current.  For  the  reference  case,  the  internal  efficiency  was 
73%  with  internal  losses  of  20  cm-1.  Temperature  dependent 
measurements  were  performed  for  both  cases,  resulting  in 
T0  =  59  K  for  the  reference  structure,  and  T0  =  29.1 
K  for  the  fusion  structure.  From  the  reference  structure’s 
results,  we  believe  that  there  are  an  additional  40  cm""1  of 
loss  in  the  fusion  structure  which  would  be  consistent  with 
the  lower  T0  values.  Work  is  currently  underway  to  identify 
the  cause  of  these  additional  losses.  One  encouraging  result 
is  that  the  quantum  efficiency  of  the  fused  laser  is  constant 
at  high  temperatures,  and  does  not  show  the  high-temperature 
reduction  common  in  InP  lasers. 

C.  Avalanche  Photodetectors 

Wafer  fusion  has  also  had  an  impact  in  the  field  of  avalanche 
photodetectors  (APD’s).  APD’s  are  designed  to  first  convert 
absorbed  light  into  an  electrical  signal,  then  amplify  this 
signal  through  avalanche  multiplication.  To  construct  a  fast, 
efficient  APD  one  must  choose  materials  with  high  optical 
absorption  coefficients  and  high  speed,  low  noise  amplification 
characteristics.  In  the  near  infrared,  important  to  optical  com¬ 
munications,  the  obvious  choice  for  an  absorber  is  InGaAs. 
For  avalanche  multiplication.  Si  is  the  semiconductor  of  choice 
due  to  the  large  disparity  in  its  electron  and  hole  ionization 
coefficients,  which  leads  to  desired  low-noise,  high-speed 
amplification. 

Efforts  to  integrate  InGaAs  with  Si  through  epitaxial  growth 
have  resulted  in  hetero-interfaces  with  large  numbers  of  thread¬ 
ing  dislocations  due  to  the  lattice  mismatch  between  the 
semiconductors.  These  threading  dislocations  produce  large 
dark  currents  in  fabricated  PIN  photodetectors  [45]  and  cause 
premature  breakdown  in  these  type  of  devices  and  prevent 
them  from  being  biased  to  the  voltages  necessary  for  avalanche 
multiplication. 

The  silicon  hetero-interface  photodetector  (SHIP)  [46]  has 
been  created  by  wafer  fusing  a  Si  substrate  to  epitaxial  InGaAs 
layers  grown  lattice  matched  to  InP.  The  fusion  is  done  in  a 
H2  atmosphere  at  a  temperature  of  650  °C,  near  the  growth 
temperature  of  the  InGaAs.  SEM  scans  of  the  fused  interface 
indicate  that  the  lattice  mismatch  between  the  two  materials 


Au/Zn  Gontact 


Illumination 


Fig.  9.  Structure  of  SHIP  detector  after  fabrication.  The  InGaAs  absorption 
layer  is  0.7-// m  thick  and  the  Si  multiplication  layer  is  0.6-// m  thick. 


Fig.  10.  Bandwidth  versus  gain  for  a  SHIP  detector  modulated  with  a  1.3 
/ tm  laser.  Points  indicate  experimental  data,  line  represents  theory. 

is  accommodated  by  edge  dislocations  that  remain  at  the 
junction  and  do  not  propagate  through  either  semiconductor. 
To  determine  the  effects  of  this  hetero-interface  on  optical 
loss  and  carrier  transport,  a  careful  study  was  done  using  a 
PIN  structure  with  undoped  Si  and  InGaAs  fused  together 
[47].  No  loss  in  quantum  efficiency  was  measured  compared 
to  InGaAs  on  InP  PIN’s  of  similar  thicknesses.  This  indicates 
the  InGaAs-Si  fused  interface  causes  no  optical  loss  and  does 
not  act  as  a  carrier  transport  barrier  in  the  detectors. 

SHIP  APD’ s  were  designed  with  doping  schemes  similar  to 
existing  separate  avalanche  and  multiplication  region  (SAM) 
APD’s.  The  electric  field  was  kept  low  in  the  absorbing 
InGaAs  layer  and  high  in  the  multiplication  layer  of  the  silicon. 
Photocurrent  gains  of  over  100  have  been  measured  for  SHIP’S 
with  thick  multiplication  regions,  and  SHIP’S  with  thinner 
regions  have  demonstrated  the  highest  gain-bandwidth-product 
(315  GHz)  for  a  near-infrared  APD  measured  to  date  [8],  more 
than  double  the  achievable  gain-bandwidth  of  InGaAs-InP 
APD’s.  Fig.  9  shows  the  structure  of  a  SHIP  APD  and  Fig.  10 
shows  its  bandwidth  as  a  function  of  photocurrent  gain. 

IV.  The  Wafer  Fused  Interface 
A.  Introduction 

The  discussions  of  Section  ID  have  highlighted  the  power 
of  the  wafer  fusion  process  in  enabling  device  optimization 
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through  integration  of  heterogeneous  materials.  Despite  these 
successes  in  device  fabrication,  there  remain  improvements 
that  can  be  made  in  the  fused  structures,  and  much  is  still  not 
understood  about  the  basic  mechanism  of  fusing  itself.  More 
complete  characterization  will  allow  us  to  understand  how 
electrically  and  optically  transmissive  we  can  make  the  fused 
interface,  how  wide  a  latitude  in  processing  parameter  space 
exists  for  reliable  fusing,  and  what  long  term  reliability  issues 
exist  for  fused  devices.  In  addition,  we  need  to  understand 
what  limitations  exist  to  fusing:  can  we  integrate  any  sets  of 
disparate  materials?  The  sections  below  discuss  some  aspects 
of  our  initial  characterization  of  the  fusing  process  and  the 
fused  interface. 


B.  Electrical  Characterization 

The  electrical  characteristics  of  fused  junctions  have  been 
made  primarily  through  current  versus  voltage  (I-V)  measure¬ 
ments  made  across  the  fused  junctions.  Several  parameters 
are  known  to  influence  the  character  of  the  junctions.  The 
cleanliness  of  the  surfaces  before  bonding,  namely,  trapping 
of  contaminants  at  the  junctions,  will  have  an  adverse  effect 
on  the  conduction  of  earners.  In  the  absence  of  surface  con¬ 
tamination,  another  parameter  that  can  influence  the  electrical 
properties  is  the  exact  matching  of  the  bonds  at  fused  junctions. 
Since  the  main  reason  for  GaAs-InP  fusion  bonding  is  to 
transcend  the  lattice  matching  requirement,  exact  bond-for- 
bond  matching  of  [100]  substrates  is  not  possible  to  realize, 
but  bonding  off-axis  cut  substrates  may  help  reduce  the  number 
of  dangling  bonds  and  dislocations. 

The  bonding  of  off-axis  substrates  has  been  investigated  for 
GalnP  on  GaAs  [48]  and  has  revealed  that  small  deviations 
from  on-axis  bonding  result  in  dramatic  increases  in  junction 
resistance.  GaAs— InP  bonding  between  different  substrate  ori¬ 
entations  also  results  in  reduced  junction  conductivity  [27]. 
Since  the  surfaces  of  different  III-V  compound  semicon¬ 
ductors  have  different  oxidation  properties  and  terminations, 
certain  surface  preparations  and  procedures  may  produce  dif¬ 
ferent  results  on  different  materials.  A  good  example  is  that 
of  [Oil]  GaP—  [001]  InGaP  in  which  there  is  a  large  difference 
in  the  V-J  characteristics  between  in-phase  and  antiphase 
bonding  [48]  whereas  for  [001]  GaAs  on  [001]  InP,  the 
difference  is  hardly  noticeable  [17]. 

The  electrical  characteristics  of  fused  junctions  critically 
depend  on  the  presence  of  potential  barriers  arising  from 
discontinuous  energy-band  lineups  and  interface  charge.  Theo¬ 
retical  considerations  predict  a  staggered  band  lineup  between 
strained  GaAs-InP  with  the  GaAs  valence  band  several  hun¬ 
dred  millielectronvolts  above  that  of  InP  [20].  However, 
no  agreement  between  theoretical  and  experimental  values 
of  band  offset  has  yet  been  reported.  Potential  barriers  at 
fused  junctions  also  originate  from  thin  oxide  layers  or  a 
high  concentration  of  surface  charge  introduced  by  surface 
contamination  before  bonding.  Oxygen  has  been  found  to  be 
the  major  contaminant,  in  addition  to  high  concentrations  of 
carbon  and  hydrogen  at  the  fused  surface.  The  SIMS  profile 
of  a  fused  GaAs— InP  epilayer  shown  in  Fig.  11  does  indeed 
reveal  the  presence  of  oxygen  at  the  interface.  This  may 


Fig.  11.  SIMS  profile  of  a  fused  GaAs-InP  epilayer. 


be  a  result  of  our  sample  preparation  procedures,  resulting 
in  high  02  incorporation  at  the  surface.  Although  the  oxide 
is  nominally  removed  prior  to  sample  transfer  to  the  fusing 
fixture,  a  ~l-nm-thick  native  may  nevertheless  form  before 
the  onset  of  the  fusing  process  itself.  Surface  oxide  layers 
and  charge  affect  both  electron  and  hole  flow,  but  because  of 
the  lower  electron  effective  mass  this  has  a  greater  impact 
on  the  holes.  For  this  reason,  n-n-junctions  are  expected 
to  perform  better  than  p-p-junctions.  Furthermore,  minority 
earner  recombination  at  the  junction  has  not  been  investigated, 
to  date.  Most  devices  employ  fused  junctions  in  regions  with 
only  majority  carrier  flow.  It  is  not  yet  clear  to  what  extent 
the  dopant  species  and  its  diffusion  across  the  fused  junction 
can  improve  the  electrical  characteristics. 

C.  Optical  Characterization 

In  spite  of  evidence  of  organic  contaminants,  misfit  dislo¬ 
cations,  and  higher  than  ideal  voltage  drops  across  the  fused 
junctions,  wafer  fused  technology  has  enabled  the  realization 
of  high  quality  interfaces  for  optoelectronic  design.  In  order  to 
assess  the  extent  over  which  the  highly  dislocated  and  contam¬ 
inated  interface  impacts  the  optical  quality  of  the  surrounding 
material,  multiple-quantum-well  (MQW)  photoluminescence 
(PL)  studies  were  performed.  The  premise  of  such  experiments 
is  simple;  the  MQW  structure  acts  as  a  depth  resolved  probe 
to  compare  the  luminescence  efficiency  of  the  material  in  the 
interface  region  before  and  after  fusion. 

Lattice  matched  InGaAs  and  GaAs  quantum  wells  of  vary¬ 
ing  widths  were  therefore  strategically  grown  on  InP  and 
GaAs  substrates  by  MOCVD  and  MBE,  respectively,  to  probe 
the  optical  quality  of  the  epilayer  region  before  and  after 
wafer  fusion;  the  structures  are  shown  in  Fig.  12.  A  reference 
quantum  well  was  grown  deep  in  each  epilayer  to  facilitate 
accurate  normalization,  4160  A  from  the  surface  in  the  InP 
sample  and  6200  A  from  the  surface  in  the  GaAs  sample. 
The  location  of  these  reference  quantum  wells  should  be  far 
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Fig.  12.  MQW  structures  for  PL  studies. 

enough  from  the  subsequently  fused  interface  to  ensure  that 
their  photoluminescence  is  not  affected  by  the  fusing  process. 
The  InP  MQW  sample  was  fused  to  a  1000-A-thick  GaAs 
epilayer  grown  by  MBE;  the  GaAs  MQW  sample  was  fused 
to  a  3000-A  thick  InP  epilayer  grown  by  MOCVD.  PL  data 
was  taken  at  10  K  with  a  514-nm  Argon  5W  laser  at  an  optical 
power  of  5  mW.  Our  current  analysis  employs  three  PL  data 
sets:  PL  of  the  “as  grown”  material  to  provide  a  baseline,  after 
fusion  upon  removal  of  the  fused  epilayer  by  wet  etching,  and 
finally  after  temperature  cycling  the  MQW  substrate  under 
the  fusing  conditions  as  described  in  Section  II  to  decouple 
the  effects  of  the  temperature  cycling  employed  in  the  fusion 
process  from  the  effects  of  fusion  itself. 

The  resulting  PL  spectra  are  shown  in  Fig.  13.  Fig.  13(a) 
shows  the  raw  PL  data  taken  on  the  InGaAs-InP  MQW 
sample,  including  the  baseline  (“as  grown”),  the  temperature 
cycled,  and  the  fused  spectra.  The  three  spectra  have  not  been 
normalized  to  a  particular  quantum  well  but  are  based  on  ab¬ 
solute  measured  luminescence  based  on  the  same  laser  power. 
We  see  no  appreciable  decrease  in  luminescence  efficiency  of 
the  quantum  wells  as  close  as  500  A  from  the  fused  interface. 
The  same  is  true  for  the  GaAs  MQW  structure  as  seen  in 
Fig.  13(b).  However,  in  this  case,  we  have  normalized  to  a 
reference  peak  at  7960  nm  and  demonstrated  what  appears 
to  be  an  increase  in  the  luminescence  efficiency  of  all  the 
quantum  wells,  venturing  as  close  as  350  A  to  the  fused 
interface.  We  currently  attribute  this  increase  in  luminescence 
intensity  to  the  fused  junction  acting  as  a  gettering  source  for 
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Fig.  13.  PL  data,  before  and  after  fusing  for  (a)  InGaAs  MQW’s  and  (b) 
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Fig.  14.  Wavelength  shifts  of  InGaAs  MQW’s  after  fusing  and  after  tem¬ 
perature  cycling. 

defects,  a  process  much  akin  to  the  gettering  of  defects  at  any 
strained  interface. 

PL  data  on  peak  position,  in  addition  to  peak  intensity,  pro¬ 
vides  information  on  the  subtle  effects  of  strain  and  materials 
interaction.  While  there  are  no  peak  shifts,  within  measurement 
error,  after  the  fusing  or  temperature-cycling  process  for  the 
GaAs  MQW  samples,  there  are  slight  shifts  in  the  InP-based 
samples.  This  is  perhaps  consistent  with  the  presumed  greater 
mobility  of  the  In;  while  temperature  cycling  alone  produces 
a  blue  shift  in  all  quantum-well  peaks,  shown  in  Fig.  14, 
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the  actual  fusing  process  introduces  a  competing  redshift 
increasing  pronounced  for  those  quantum  wells  closest  to  the 
fused  interface.  More  work  is  required  to  fully  decouple  the 
effects  of  strain,  strain  relaxation,  compositional  intermixing, 
and  defect  passivation  introduced  through  the  fusing  process. 
Nevertheless,  these  initial  low  temperature  PL  measurements 
provide  an  optimistic  prognosis  for  the  optical  quality  of  the 
fused  interface,  and  the  material  immediately  adjacent  to  that 
interface. 


V.  Conclusion 

Fusing  frees  the  device  designer  from  the  constraints  of 
lattice  matched  growth  or  critical  thicknesses  of  strained  lay¬ 
ers.  Although  heteroepitaxy  results  in  threading  dislocations, 
fusing  dissimilar  lattice  constant  materials  results  in  interfaces 
with  edge  dislocations  instead  of  threading  dislocations.  The 
device  results  to  date  show  dramatic  improvements  in  telecom¬ 
munication  VCSEL’s,  in  telecommunication  APD’s,  and  in 
visible  LED's.  Expansion  of  these  techniques  to  II- VI  and  IV 
materials  will  be  very  interesting  in  the  next  few  years.  These 
techniques  are  ideal  for  optoelectronic  integration  and  for  new 
devices  that  have  not  yet  been  considered. 
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A  vertical  directional  coupler  fabricated  using  wafer  fusion  is  demonstrated  with  a  very  short 
coupling  length  of  62  pm.  The  optical  propagation  loss  introduced  by  the  fused  layer  is  investigated. 
An  excess  loss  of  1.1  dB/cm  at  1.55  pm  was  measured  for  waveguides  which  incorporate  a  fused 
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Compact  semiconductor  optical  waveguide  switches  are 
critical  components  in  photonic  integrated  circuits  for  high¬ 
speed  optical  communication  networks  and  optical  informa¬ 
tion  processing.  A  large-scale  switching  array  requires  low 
space  consumption,  so  it  is  essential  to  minimize  the  length 
of  each  switch.  Conventional  directional  couplers  with  later¬ 
ally  arranged  waveguides1  cannot  achieve  very  short  cou¬ 
pling  lengths  because  of  low  modal  overlap  and  because  of 
technological  limits  to  obtaining  uniform  and  small  gap  lay¬ 
ers.  Vertical  directional  couplers,  on  the  other  hand,  can  offer 
a  short  coupling  length  which  is  even  smaller  than  100 
pm.—  The  difficulty  of  separating  the  two  vertical  coupled 
waveguides  into  two  distinct  inputs  and  outputs  limits  the 
application  of  these  couplers  to  large-scale  switching  arrays. 
In  this  letter,  we  demonstrate  a  fused  vertical  coupler  (FVC) 
with  a  very  short  coupling  length,  which  can  solve  this  prob¬ 
lem. 

Wafer  fusion  is  a  powerful  technique  to  fabricate  struc¬ 
tures  that  cannot  be  realized  by  conventional  epitaxial 
growth  and  processing.  In  addition  to  the  inherent  advantage 
of  combining  material  with  different  lattice  constants,4  wafer 
fusion  can  give  an  extra  degree  of  freedom  in  the  design  and 
fabrication  of  three-dimensional  (3D)  photonic  devices.5  By 
displacing  the  input  and  output  waveguides  vertically  in  dif¬ 
ferent  planes,  scaling  of  switch  arrays  to  large  sizes  is  easy  to 
realize  (Fig.  1).  In  order  to  switch  a  large  number  of  input 
waveguides,  it  is  essential  to  have  compact,  high  extinction 
ratio  fused  vertical  couplers.  In  the  following,  we  will  ana¬ 
lyze  theoretically  and  experimentally  straight  fused  vertical 
couplers. 

The  structure  of  the  FVC  is  shown  in  Fig.  2.  The  mate¬ 
rial  was  grown  by  metal-organic  chemical-vapor  deposition 
and  consisted  of  a  0.5  pm  InGaAsP  (X=1.3  pm)  guiding 
layer  on  an  InP  substrate,  followed  by  a  0.1  pm  InP  cladding 
layer,  a  20  nm  InGaAsP  (X  =  1.15  ^m)  etch-stop  layer,  and  a 
0.4  pm  InP  coupling  layer.  To  fabricate  the  vertical  coupler, 
two  8  X  10  mm-  samples  are  cleaved  from  the  grown  wafer. 
In  the  first  sample,  the  top  0.4  pm  InP  layer  is  removed.  On 
the  second  sample,  a  ridge  waveguide  (WG)  structure  is  fab¬ 
ricated  using  standard  photolithography  and  selective  wet 
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etching.  The  ridges  have  3-6  pm  width,  0.4  pm  height,  and 
they  are  separated  by  125  pm.  The  two  samples  are  then 
fused  together  at  a  temperature  of  630  °C  in  a  hydrogen  at¬ 
mosphere  for  30  min.  Figure  1  shows  the  stain  etched  scan¬ 
ning  electron  microscope  (SEM)  picture  of  a  finished  FVC. 
The  fused  interface  is  not  visible,  even  after  staining.  This  is 
an  indication  of  the  high  quality  of  the  fusion  process.  There 
is  mass  transport  at  the  edge  of  the  ridge.  This  is  beneficial  in 
obtaining  a  symmetric  coupler  and  improves  the  sidewall 
flatness. 

The  near-field  pattern  at  the  output  of  the  FVCs  is  re¬ 
corded  by  an  IR  camera  and  is  shown  in  Fig.  3.  The  light  is 
the  input  from  a  8  pm  diam  single-mode  fiber.  The  total 
length  of  the  FVC  is  5.5  mm.  It  can  be  seen  that  by  changing 
the  input  wavelength,  the  light  is  switched  from  the  upper  to 
the  lower  waveguide.  Since  the  shapes  of  the  two  waveguide 
modes  are  very  similar,  one  can  obtain  a  high  extinction 
ratio.  Our  measurement  shows  the  extinction  ratio  is  about 
15  dB.  This  is  particularly  difficult  to  achieve  in  conven¬ 
tional  high  mesa  vertical  couplers.2  BPM6  simulations  show 
that  extinction  ratios  up  to  20—35  dB  are  possible.  Figure  4 
shows  the  intensities  of  the  upper  and  lower  waveguides  as  a 
function  of  wavelength.  From  the  oscillation  period  (about 
11.5  nm),  and  considering  material  and  waveguide  disper- 


FIG.  1.4X4  crossbar  switch  array  based  on  vertical  coupling  between  two 
substrates. 
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FIG.  2.  The  stain  etched  SEM  picture  of  a  fused  vertical  coupler. 


sions,  the  index  difference  between  the  even  and  odd  modes 
can  be  calculated,  which  is  0.0124.  The  corresponding  cou¬ 
pling  length  is  62  /xm  at  1.55  /zm,  which  agrees  well  with  the 
theoretical  value  of  58  /xm,  derived  from  3D  BPM  calcula¬ 
tions.  One  important  advantage  of  this  strong  vertical  cou¬ 
pling  is  the  capability  of  attaining  highly  selective  WDM 
add/drop  multiplexers  by  making  longer  couplers  with  sig¬ 
nificant  overcoupling.  Figure  4  illustrates  the  possibility  of  a 
lossless  combination  or  splitting  of  two  signals  separated  by 
just  6  nm.  The  wavelength  selectivity  of  a  FVC  can  be  en¬ 
hanced  by  using  strong  asymmetry  in  the  waveguide  struc¬ 
ture. 

To  investigate  the  optical  propagation  loss  at  1.55  /xm 
due  to  the  fused  interface,  two  single-mode  ridge  waveguides 
are  fabricated.  The  first  one  has  a  0.5  /xm  InGaAsP  (1.3  /xm 
quartenary)  core  region,  a  0.2  /xm  InP  slab  layer,  and  a  0.6 
/xm  InP  ridge  height.  The  second  sample  is  identical  to  the 
first  except  for  a  fused  interface  in  the  middle  of  the  InP  slab 


Intensity 


FIG.  3.  Photograph  of  the  near-field  pattern  at  1530,  1533,  and  1536  nm. 
The  width  of  the  ridge  is  3  yum,  and  the  distance  between  the  upper  and  the 
lower  WGs  is  1.1  /xm.  The  left  curve  is  the  profile  of  the  near  field. 
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FIG.  4.  Measured  intensity  of  the  upper  (closed  circle)  and  lower  (open 
circle)  waveguides  as  a  function  of  wavelength. 


layer,  where  the  mode  field  is  as  high  as  50%  of  the  maxi¬ 
mum.  A  Fabiy-Perot  resonance  technique  is  used  to  measure 
the  optical  loss.7  The  second  sample  shows  only  1.1  dB/cm 
excess  loss  due  to  the  presence  of  the  fused  interface  com¬ 
pared  with  the  unfused  single-mode  waveguides.  We  believe 
it  can  be  reduced  further  with  the  improvement  of  the  fusion 
process. 

The  issue  of  uniformity  for  fused  waveguides  is  very 
important  for  large-scale  monolithic  integration.  For  the 
present  size  of  our  fused  wafer  (8  X 10  mm2),  we  found  very 
good  uniformity  in  terms  of  excess  loss  or  coupling  length. 
More  than  90%  of  the  waveguides  can  work  well,  even  for 
the  ones  near  the  edge  of  the  fused  wafers.  We  have  also 
checked  the  multimode  behavior  of  the  150  /zm  width  slab 
fused  waveguides.  They  do  not  show  any  dark  spots  or  dead 
regions. 

In  conclusion,  a  fused  vertical  coupler  was  demonstrated 
with  a  62  /xm  coupling  length.  It  is  shown  that  1.1  dB/cm 
excess  optical  loss  is  introduced  due  to  the  fusion  process, 
which  is  even  lower  than  the  loss  caused  by  doping.  The 
fused  vertical  couplers  and  waveguides  give  us  an  added 
advantage  of  vertical  dimension  by  separating  the  input  and 
output  waveguides  to  realize  compact  and  scalable  3D  direc¬ 
tional  coupler  structures. 
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Various  means  of  increasing  the  extinction  ratio  in  ultra  short  directional  couplers  using  slight  asymmetry  in  the  coupled 
waveguide  structure  are  investigated.  It  is  shown  that  couplers  with  10-200  ,im  coupling  length  and  with  high  extinction  ratios 
larger  than  30  dB  can  be  achieved.  Shorter  asymmetric  couplers  have  an  extinction  ratio  that  is  more  fabrication  tolerant. 
KEYWORDS:  directional  coupler,  coupled  mode  theory,  extinction  ratio,  asymmetric  couplers,  vertical  couplers 


1.  Introduction 

Directional  couplers  are  critical  components  in  photonic 
integrated  circuits  used  in  optical  communication  systems. 
The  conventional  directional  couplers  with  laterally  arranged 
waveguides  can  not  achieve  very  short  coupling  lengths  due 
to  extreme  sensitivity  to  fabrication  variations  and  limitations 
to  produce  uniformly  very  narrow  gap  layers.1'  Vertical  direc¬ 
tional  couplers,  however,  can  obtain  a  short  coupling  length 
that  is  smaller  than  100  The  difficulty  of  separating 

the  two  vertically  coupled  waveguides  into  two  distinct  inputs 
and  outputs  limits  the  application  of  these  devices.3)  Recently, 
a  novel  fused  vertical  coupler  (FVC)  with  a  very  short  cou¬ 
pling  length  of  62  fim  was  demonstrated,  that  can  solve  this 
problem.4'  To  use  FVC  in  large  switching  fabrics,  it  should 
have  a  low  crosstalk.  Ultra  short  directional  couplers  have 
an  inherent  limitation  in  their  extinction  ratio  due  to  non¬ 
orthogonality  of  individual  waveguide  modes.5'  In  this  paper, 
we  analyze  various  means  to  improve  the  extinction  ratio  us¬ 
ing  slight  asymmetry  in  the  structure.  We  also  study  the  fab¬ 
rication  tolerance  needed  to  maintain  these  improved  charac- 
teriatics.  Since  vertical  coupling  through  the  ridge  structure 
defined  by  etch-stopping  techniques  is  much  less  sensitive  to 
the  ridge  waveguide  width  and  sidewall  smoothness  than  the 
planar  waveguide  couplers,®  it  is  expected  that  these  slight 
asymmetric  structures  can  be  better  realized  in  fused  vertical 
coupler  configuration. 


(a) 

^ca 


ncb 


2.  Model  Used  in  the  Analysis 

Figure  1(a)  shows  the  FVC  with  separated  input  and  out¬ 
put  waveguides.  Since  the  two  waveguides  are  brought  to¬ 
gether  with  an  air  gap  except  the  interaction  region,  we  could 
minimize  unwanted  couplings  for  the  separation  of  the  output 
of  the  FVC.  The  two-dimensional  index  profile  of  the  FVC 
is  reduced  to  one  dimension  using  effective  index  method. 
The  schematic  diagram  of  one  dimension  index  profile  in  the 
straight  coupling  region  is  shown  in  Fig.  1(b).  The  structure  is 
composed  of  two  waveguides  A  and  B  with  effective  indices 
/?a  and  nby  and  thicknesses  d a  and  d b,  separated  by  an  inner 
cladding  region,  index  nci  and  thickness  t.  The  outer  cladding 
regions  have  indices  nca  and  nc b.  Since  the  two  waveguides 
are  strongly  coupled  (t  =  0.2-1  p m  separation),  an  improved 
coupled  mode  theory  (ICMT)  is  used  to  model  accurately  the 
crosstalk.  This  theory  takes  into  account  explicitly  the  finite 
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(b) 

Fig.  1.  (a)  Fused  vertical  coupler  with  separated  input  and  output  waveg¬ 
uides.  (b)  Schematic  diagram  of  one  dimension  index  profile  in  the  straight 
interaction  region. 

overlap  integral  between  modes  of  individual  waveguides.7^ 
Assuming  that  the  power  is  incident  into  the  waveguide  A, 
without  the  loss  of  generality,  the  extinction  ratio  after  a  dis¬ 
tance  equal  to  the  coupling  length  is  defined  as  pb/ p&  where 
P&  Pb  arc  the  guided  mode  powers  in  waveguides  A  and 
B  respectively.  The  wavelength  used  in  the  calculation  is 
1.55  pm. 

3.  Results 

Figures  2  and  3  show  the  extinction  ratio  and  coupling 
length  of  TE  and  TM  modes  as  a  function  of  the  refractive  in¬ 
dex  of  waveguide  A  using  ICMT  and  also  2D  finite  difference 
beam  propagation  method  (BPM).  The  parameter  values  used 
in  the  calculation  are  nh  =  3.37,  wca  =  ncb  =  nci  =  3.17, 
<4  =  4=  0.5  pm,  and  /  =  0.6  pm.  We  can  see  that  the  data 
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Fig.  2.  Extinction  ratio  for  TE  and  TM  modes  as  a  function  of  the  index  of 
waveguide  A  calculated  using  ICMT  and  BPM. 
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Fig.  3.  Coupling  length  for  TE  and  TM  modes  as  a  function  of  the  index 
of  waveguide  A  calculated  using  ICMT  and  BPM. 


calculated  by  ICMT  agrees  very  well  to  that  by  BPM.  The  ex¬ 
tinction  ratio  larger  than  50  dB  for  TE  (TM)  mode  occurs  at 
na  =  3.367  (na  =  3.366)  and  the  coupling  length  in  this  case 
is  51  fxm  (47  fim  ).  The  extinction  ratio  and  the  coupling 
length  in  the  symmetric  case  for  TE  (TM)  mode  are  16.4  dB 
(15.7  dB)  and  52  fim  (48  /xm)  respectively.  We  can  see  that 
the  extinction  ratio  is  increased  considerably  at  the  expense  of 
having  less  than  100%  power  tranfer  to  the  waveguide  B  by 
slight  detuning  of  the  two  waveguide  eigenmodes  as  Chuang 
showed  in  ref.  10.  The  coupling  length  is  not  significantly 
affected  by  making  the  waveguides  asymmetric. 

Figure  4  shows  the  optimum  asymmetry  in  the  index  of 
waveguide  A  («a)  for  TE  and  TM  modes  as  a  function  of 
waveguide  separation  with  the  parameters  used  in  Figs.  2  and 
3.  It  also  displays  the  percent  tolerance  (Ana/«a)  to  obtain  an 
extinction  ratio  larger  than  30  dB.  One  can  see  that  the  opti¬ 
mum  value  of  7za  for  TE  mode  is  larger  than  that  of  TM  mode, 
while  the  percent  tolerance  of  TM  mode  is  larger  than  that  of 
TE  mode.  When  the  waveguides  are  separated  by  0.6  /xm,  the 
asymmetry  defined  by  «b  —  na  required  to  achieve  the  highest 
extinction  ratio  for  TE  (TM)  mode  is  0.003  (0.004).  Having 
index  of  waveguide  A  within  ±  0.019  %  (±  0.021  %)  of  the 
optimum  value  for  TE  (TM)  mode,  one  can  achieve  the  ex¬ 
tinction  ratio  larger  than  30  dB.  As  the  separation  between 
the  two  waveguides  decreases,  more  asymmetry'  is  needed 
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Fig.  4.  Index  of  waveguide  A  for  maximum  extinction  ratio  and  tolerance 
to  achieve  larger  than  30  dB  extinction  ratio  as  a  function  of  waveguide 
separation. 
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Fig.  5.  Coupling  length  of  the  asymmetric  coupler  for  the  different  values 
of  na  corresponding  to  Fig.  4  and  its  percent  difference  with  respect  to  the 
symmetric  case  as  a  function  of  waveguide  separation. 


but  the  tolerance  to  obtain  an  extinction  ratio  larger  than  30 
dB  increases  to  0.24  %  (0.27%).  This  means  that  the  peak 
in  extinction  ratio  in  Fig.  2  becomes  broader  as  the  coupling 
length  decreases.  This  facilitates  the  fabrication  of  passive  ul¬ 
tra  short  asymmetric  couplers  or  setting  the  voltage  or  current 
in  active  components. 

Figure  5  shows  the  coupling  length  of  the  optimum  asym¬ 
metric  coupler  for  the  different  values  of  «a  corresponding  to 
Fig.  4  and  its  percent  difference  with  respect  to  the  the  sym¬ 
metric  case  as  a  function  of  waveguide  separation.  The  cou¬ 
pling  length  decreases  and  the  percent  difference  increases 
as  the  separation  decreases.  One  can  see  that,  as  expected, 
the  coupling  length  of  TM  mode  is  shorter  than  that  of  TE 
mode.  By  making  the  waveguides  very  close  to  each  other 
(<  0.4  fim)  one  can  achieve  ultra  short  coupling  lengths  less 
than  30  \x m  and  at  the  same  time  maintain  the  extinction  ratio 
larger  than  30  dB. 

In  the  case  of  symmetric  couplers,  even  though  the  cou¬ 
pling  lengths  are  comparable  to  those  of  asymmetric  ones  (see 
Fig.  5),  the  extinction  ratios  are  severely  deteriorated  to  less 
than  15  dB  when  the  separation  between  the  two  waveguides 
is  less  than  0.55  fim  (see  Fig.  6). 

To  confirm  that  the  effect  of  the  asymmetry  on  the  extinc¬ 
tion  ratio  comes  from  the  slight  difference  in  the  shapes  of 
the  two  waveguide  eigenmodes,  the  extinction  ratio  and  cou¬ 
pling  length  of  TE  and  TM  modes  as  a  function  of  the  width 
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Fig.  6.  Extinction  ratio  for  TE  and  TM  modes  for  the  symmetric  coupler 
as  a  function  of  waveguide  separation. 


of  waveguide  A  rather  than  its  refractive  index  are  calculated 
with  the  parameter  values  na  =  nb  =  3.37,  «ca  =  ncb  =  nci  = 
3.17,  4  =  0.5  ia, m,  and  t  =  0.6  fim.  An  extinction  ratio 
larger  than  42  dB  for  TE  (TM)  mode  occurs  at  4  =  0.486  n m 
(0.484  fu,m)  while  the  extinction  ratio  in  the  symmetric  case 
was  around  16  dB.  The  effect  of  inner  cladding  layer  on  the 
characteristics  of  the  coupler  is  also  studied.  Figure  7  shows 
the  extinction  ratio  of  TE  and  TM  modes  as  a  function  of  the 
refractive  index  of  inner  cladding  layer  for  the  symmetric  and 
asymmetric  structures.  The  parameter  values  used  in  the  cal¬ 
culation  are  nb  =  3.37,  nca  =  ncb  =  3.17,  <4  =  4  =  0.5  fi m, 
t  =  0.6  fim,  na  =  3.367  for  the  asymmetric  structure  and 
na  =  3.37  for  the  symmetric  one.  The  extinction  ratio  for  the 
symmetric  structure  does  not  change  much  as  a  function  of 
the  refractive  index  of  the  inner  cladding  layer.  It  increases 
slightly  as  the  inner  cladding  layer  index  decreases  due  to 
smaller  overlap  integral  between  two  individual  waveguide 
modes.  For  the  asymmetric  structure,  the  extinction  ratio 
changes  considerably  (  >  40dB)  as  a  function  of  the  inner 
cladding  layer  index.  The  highest  extinction  ratio  occurs  at 
nci  =  3.169  for  TE  mode  and  at  nci  =  3.157  for  TM  mode. 
A  slight  asymmetry  can  equalize  the  overlap  integral  of  one 
of  the  waveguide  modes  with  the  symmetric  and  antisymmet¬ 
ric  supermodes  of  the  coupler  and  thus  increase  the  extinction 
ratio.5) 

4.  Conclusions 

We  have  shown  that  one  can  achieve  ultra  short  vertical  di¬ 
rectional  couplers  (coupling  length  10-200  fim)  while  main¬ 


3.15  3.155  3.16  3.165  3.17  3.175  3.18  3.185  3.19 
Inner  Cladding  Layer  Index  (nci) 

Fig.  7.  Extinction  ratio  for  TE  and  TM  modes  for  the  asymmetric  and  sym¬ 
metric  structures  as  a  function  of  refractive  index  of  inner  cladding  layer. 

taining  a  high  extinction  ratio  larger  than  30  dB  with  slight 
asymmetry  in  the  coupled  waveguide  structure.  Shorter  asym¬ 
metric  couplers  have  an  extinction  ratio  that  is  more  fabrica¬ 
tion  tolerant.  Also,  we  confirmed  that  the  high  extinction  ra¬ 
tio  of  asymmetric  stuctures  comes  from  the  slight  difference 
in  the  shapes  of  the  two  waveguide  eigenmodes  by  changing 
the  refractive  index  of  inner  cladding  layer  and  the  width  of 
one  of  the  waveguides. 
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Experimental  results  and  discussion:  The  wavelength-tunable  device 
consists  of  two  iron-based  programmable  magnets.  5.6cm  long 
and  0.26cm  in  diameter,  and  a  3.8cm  long  fibre  Bragg  grating.  We 
chose  a  magnetic  material  with  a  low  coercivily  (H(.)  value  of  85 
for  ease  of  programmability  with  low  fields.  The  magnets  and 
grating  were  packaged  inside  a  stainless  steel  tube  and  inserted 
inside  a  solenoid.  The  solenoid  is  ~10cm  long  with  5000  turns  and 
produces  a  field  of  ~1000e  with  a  current  of  ~  140  mA. 


Fig.  2  Superimposed  transmission  spectra  showing  various  wavelength 
shifts 

(i)  Onm 

(ii)  0.8nm 

(iii)  1.6nm 

(iv)  2.4nm 

(v)  3.2nm 

(vi)  4.0nm 


Fig.  3  Wavelength  shift  against  applied  field 


When  the  magnitude  of  the  pulse  current  in  the  solenoid  is 
altered,  the  magnetisations  in  the  two  magnets  change,  thus  affect¬ 
ing  the  magnetic  force  and  hence,  the  latched  fibre  grating  wave¬ 
length.  Fig.  2  shows  actual  data  of  the  programmably  altered  and 
latched  wavelength  shifts  induced.  The  transmission  spectrum  of  a 
Bragg  grating  with  initial  wavelength  at  1 549.7 nna,  on  activation 
of  the  device  with  applied  magnetic  fields  of  105,  118,  127,  138, 
and  1680e.  is  reproduciblv  shifted  by  0.8,  1.6,  2.4,  3.2  and  4.0nm, 
respectively.  These  wavelength  shifts  approximately  correspond  to 
a  one-,  two-,  three-,  four-  and  five-channel  shift,  respectively,  in  a 
100 GHz  channel  spacing  DWDM  communications  system.  The 
extent  of  wavelength  shift  changes  with  the  magnetic  field  strength 
(Fig.  3),  although  nonlinearly,  reflecting  the  magnetisation  loop 
shape.  The  wavelength  shift  occurs  relatively  quickly,  in  the  time 
frame  of  the  order  of  milliseconds. 

7  o  demonstrate  the  capability  of  our  technique,  we  built  a  simi¬ 
lar  device  with  increased  magnet  dimensions  and  magnetic  field 
intensity  for  maximum  wavelength  tunability.  Fig.  4  shows  an 
example  of  a  broad-range  wavelength  shift  induced  in  a  fibre 
Bragg  grating.  The  wavelength  shift  in  this  case  was  ~15.7nm 
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(from  1560.7  to  1 576.4 nm)  with  a  pulsed  field  of  230Oe.  This  shift 
would  correspond  to  about  a  39-channel  shift  in  a  50GHz 
DWDM  system. 


Fig.  4  Transmission  spectrum  of  fibre  Bragg  grating  showing  large  shift 
of  15. 7nm 


Conclusion:  Broad-range  strain-tuning  of  fibre  Bragg  gratings 
using  latchable  magnetic  force  has  been  demonstrated.  The  attrac¬ 
tive  magnetic  force  between  the  poles  of  two  programmable  and 
latchable  magnets  controllably  shifts  the  wavelength  of  a  Bragg 
grating  by  more  than  I5nm.  The  key  advantages  of  this  technique 
are  that  the  strain  tuning  is  latchable.  a  high  magnitude  of  wave¬ 
length  shift  is  possible,  and  the  tuning  speed  is  high.  These  novel 
tunable  fibre  gratings  can  find  use  in  a  variety  of  optical  network¬ 
ing  applications  such  as  programmable  add/drop  multiplexing  in 
DWDM  communication  systems. 
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Fused  vertical  coupler  switches 

B.  Liu,  A.  Shakier,  P.  Abraham  and  J.E.  Bowers 

Electrical  switching  of  fused  vertical  couplers  (FVCs)  is 
demonstrated.  For  a  3.5mm  long  coupler,  complete  switching  is 
observed  at  3V  reverse  bias.  This  low  voltage  is  attributed  to 
excess  current  induced  internal  heating  and  the  thermo-optic 
effect.  By  changing  the  electrode  design  and  reducing  the  excess 
current,  a  6.9mm  long  FVC  with  12V  switching  voltage  is 
demonstrated. 
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Vertical  directional  couplers  [1  -  4]  are  attractive  candidates  for 
realising  photonic  switches  and  narrowband  filters  because  of  their 
very  short  coupling  length  and  the  feasibility  of  integration  with 
other  optoelectronic  devices.  However  in  conventional  vertical 
couplers,  the  two  input  or  output  waveguides  arc  so  close  together 
that  direct  coupling  of  individual  waveguides  with  fibres  is  very 
difficult.  This  has  limited  the  practical  applications  of  vertical  cou¬ 
plers  in  fibre  optic  systems.  We  recently  proposed  a  novel  fused 
vertical  coupler  [5]  based  on  a  wafer  fusion  technique.  By  displac¬ 
ing  the  input  and  output  waveguides  vertically  in  different  planes, 
separation  of  two  strongly  coupled  waveguides  and  a  scalable 
switch  array  can  be  easily  realised.  A  passive  fused  vertical  coupler 
(FVC)  with  a  very  short  coupling  length  of  62pm  has  been  demon¬ 
strated.  In  this  Letter,  we  report  the  electrical  performance  of  a 
pin  straight  fused  vertical  coupler. 


Fig.  1  Schematic  drawing  of  fused  vertical  coupler 


Ridge  waveguide  in  centre  is  ‘protected’  during  fusion  process  by 
large  slab  regions  on  the  two  sides 


The  schematic  structure  of  the  FVC  is  shown  in  Fig.  1.  Two 
wafers  are  grown  using  MOCVD.  One  was  on  an  n~  InP  substrate 
and  the  other  on  a  substrate.  These  samples  had  a  0.5  pm 
InGaAsP  (\  =  1.3  pm)  guiding  layer,  followed  by  a  0.1pm  InP 
cladding  layer,  a  20nm  InGaAsP  =  1.15  pm)  etch  stop  layer 
and  finally  a  0.4pm  InP  coupling  layer.  All  layers  were  undoped. 
Device  fabrication  starts  by  cleaving  two  HO  x  12mm2  samples 
from  the  grown  wafers.  In  one  sample  the  top  0.4  pm  InP  layer  is 
removed.  On  another  sample,  a  ridge  waveguide  structure  along 
the  [110]  direction  is  fabricated  using  standard  photolithography 
and  selective  wet  etching  techniques.  To  support  the  nairow,  2- 
5pm  wide,  0.4pm  high  ridges  during  the  fusion  process,  10pm 
wide  InP  layers  were  etched  on  both  sides  of  the  ridges  as  shown 
in  Fig.  1.  The  adjacent  ridge  waveguides  are  separated  by  125pm. 
The  two  samples  are  then  fused  together  at  a  temperature  of 
630°C  in  a  hydrogen  atmosphere  for  30min.  After  fusion,  the  sam¬ 
ple  was  thinned  to  200pm  using  an  HC1  etchant  300nm  of  gold 
was  then  deposited  on  both  sides  for  applying  the  bias  voltage. 


•10-8-6-4-2-0  2  4 
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Fig.  2  l-V  curves  of  pin-FVC  with  broad  and  narrow  ohmic  contacts 

i)  broad  Ohmic  contact 

ii)  narrow  Ohmic  contact 


Fig.  2(0  shows  the  measured  I-V  curves  of  the  FVCs.  The  sam¬ 
ple  size  is  3.5  x  4.5mm2  Since  80%  of  the  area  of  two  samples  are 
fused  together,  and  only  3%  of  this  area  is  the  actual  ridge 
waveguide,  there  is  a  large  leakage  current  that  can  be  reduced  by 
etching  mesas  and  depositing  metal  only  on  the  FVC  ridge 
regions.  When  the  wafer  fusion  technique  is  used  to  fabricate 
VCSELs  and  detectors  [6],  those  devices  are  relatively  small  and 
uniformity  of  the  fused  material  is  not  so  critical  for  individual 
device  operation.  To  make  long  waveguide  couplers  and  switches, 
on  the  other  hand,  good  uniformity  of  the  fusion  interface  is 
required.  We  used  electroluminescence  images  of  the  FVC  to  study 
the  fusion  uniformity  under  current  flow.  The  near  field  pattern  at 
the  output  facet  of  the  coupler  is  observed  by  an  IR  camera  with  a 
x80  objective  lens.  Fig.  3  shows  the  luminescence  image  of  the 
fused  area  at  200mA  forward  current.  The  image  area  is  64pm 
wide.  Luminescence  from  the  two  quartan  cry  layers  can  be  clearly 
distinguished.  There  arc  no  dark  areas  and  the  intensity  is  very 
uniform  along  the  fused  interface. 


Fig.  3  Electroluminescence  image  of  64pm  wide  fused  area  under 
200mA  current  and  intensity  distribution  of  upper  and  lower  guiding 
layers 


reverse  bias,  V 

Fig.  4  Measured  intensity  of  upper  and  lower  waveguides  against 
reverse  bias  for  broad  area  Ohmic  contact  FVC 

•  upper  waveguide 
O  lower  waveguide 


To  characterise  the  FVCs,  a  tunable  laser  is  used  to  launch  light 
at  the  input  of  the  coupler  through  an  8  pm  diameter  singlemode 
fibre.  The  near  field  images  at  the  output  of  a  3.5mm  long  FVC 
for  three  reverse  biases  0,  1.8  and  3  V  are  shown  in  Fig.  4,  along 
with  the  normalised  intensities  of  the  upper  and  lower  waveguides. 
The  linear  electro-optic  effect  at  3  V  is  too  small  to  explain  the 
switching.  We  believe  that  the  thermo-optic  effect  plays  a  major 
role  in  this  device  because  of  the  high  leakage  current  which  con¬ 
tributes  to  internal  heating  of  this  structure.  To  confirm  this,  we 
changed  the  stage  temperature,  and  switching  is  observed  when 
30  °C  temperature  change.  To  reduce  the  leakage  current,  we  fabri¬ 
cated  another  FVC.  In  this  case,  one  of  the  InP  substrates  is 
removed  and  the  electrodes  are  evaporated  on  the  exact  ridge  area 
through  narrow  windows  of  an  SiN  insulation  layer.  The  I-V 
curve  of  a  7mm  x  3pm  FVC  with  this  modification  is  shown  in 
Fig.  2(h).  As  can  be  seen  in  Fig.  5,  a  12  V  reverse  bias  is  needed  to 
achieve  switching.  The  insert  pictures  show  the  near  fields  at  dif¬ 
ferent  biases. 
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In  conclusion,  the  switching  of  fused  vertical  couplers  has  been 
demonstrated.  For  bulk  InGaAsP  materials,  a  12V  reverse  bias  is 
needed  to  realise  the  switching  for  a  6.9mm  long  device.  Using 
quantum  well  structures,  the  switching  voltage  can  be  further 
reduced.  The  wafer  fusion  technique  can  be  used  to  fabricate  verti¬ 
cal  couplers  and  3D  photonic  integrated  circuits  without  requiring 
complicated  regrowths. 
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|  upper  waveguide 
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reverse  bias,  V 

Fig.  5  Measured  intensity  of  upper  and  lower  waveguides  against  reverse 
bias  for  narrow  area  Ohmic  contact  FVC 

#  upper  waveguide 
0  lower  waveguide 


Acknowledgments:  This  work  is  supported  by  the  AFOSR  and 
DARPA  center  MOST. 

©  IEE  1998  4  September  1998 

Electronics  Letters  Online  No:  1998 1468 

B.  Liu,  A.  Shakier,  P.  Abraham  and  J.E.  Bowers  ( Department  of 
Electrical  and  Computer  Engineering,  University  of  California .  Santa 
Barbara,  C A  93106.  USA) 

E-mail:  liu^opto. ucsb.edu 

A.  Shakier:  Permanent  address:  Jack  Baskin  School  of  Engineering, 
University  of  California,  Santa  Cruz,  CA  95064.  USA 


References 

1  k.ohtoku.  m.,  baba,  s.,  arai.  s.,  and  SUEMATSU.  Y.:  ‘Switching 
operation  in  a  GalnAs-TnP  MQW  integrated -twin-guide  (ITG) 
optical  switch’,  IEEE  Photonics  Technol.  Lett.,  1991,  3,  (3),  pp. 
225-226 

2  DOLLINGER.  F.,  BORCKE,  M.,  BOHM.  G..  TRANKLE.  G.,  and 
weimann,  G.:  ‘Ultra short  low-loss  optical  multiquantum-well 
GaAs/GaAlAs  vertical  directional  coupler  switch’.  Electron.  Lett., 
1996,32,06),  pp.  1509-1510 

3  ALFERNESS.  R.C.,  BUHL.  L.L.,  KOREN,  U.,  YOUNG.  M.G.,  KOCIL  T.L., 
burrus.  C.A.,  and  raybon.  g:  ‘Broadly  tunable  InGaAsP/InP 
buried  rib  waveguide  vertical  coupler  filter,  Appi  Phys.  Lett.. 
1992,  60,  (8),  pp.  980-982 

4  CHI  WU. ,  ROLLAND.  C.,  PUETZ.  N..  BRUCE,  R.,  CH1K,  K.D.,  and  XU.  J.M.: 

'A  vertically  coupled  InGaAsP/InP  directional  coupler  filter  of 
ultranarrow  bandwidth’.  IEEE  Photonics  Technol.  Lett..  1991.  3, 

(6),  pp.  519-521 

5  BIN  LIU.  AH  SHAKOURI,  ABRAHAM.  P..  KIM.  B.G.,  JACKSON.  A. W.,  and 

bowers.  J.E.:  ‘Fused  vertical  couplers’,  Appl.  Phys.  Lett.,  1998,  72. 
(21),  pp.  2637-2638 

6  BLACK,  A.,  HAWKINS.  A.R.,  MARGALIT.  N.M..  BABIC.  D.I.. 
HOLMES.  A.L.,  Jr.,  CHANG.  Y.L.,  ABRAHAM,  P  ,  BOWERS.  J.E „  and 
hu,  E.L.:  ‘Wafer  fusion:  Materials  issues  and  devices  results’,  IEEE 
J.  Scl.  Topics  Quantum  Electron..  1997,  3,  (3),  pp.  943-951 

ELECTRONICS  LETTERS  29th  October  1998  Vol.  34 


Highly  sensitive  and  compact  cross¬ 
correlator  for  measurement  of  picosecond 
pulse  transmission  characteristics  at 
1550nm  using  two-photon  absorption  in  Si 
avalanche  photodiode 

K.  Kikuchi,  F.  Futami  and  K.  Katoh 

A  highly  sensitive  and  compact  cross-correlator  in  which  an  Si 
avalanche  photodiode  (APD)  is  used  as  a  two-photon  absorber  is 
described,  which  was  developed  for  the  measurement  of 
picosecond  pulse  transmission  characteristics  at  1550nm  A 
10GHz.  Ups  pulse  under  test  is  generated  from  an  extemal- 
cavitv  modelocked  semiconductor  laser,  and  is  transmitted 
through  an  80km  long  dispersion-shifted  fibre.  A  200 fs  sampling 
pulse  is  obtained  from  the  pulse  under  test  by  pulse  compression. 
An  Si  APD  is  used  as  a  two-photon  absorber  for  the  cross- 
correlation  measurement  of  the  transmitted  pulse  waveform.  The 
asymmetric  pulse-waveform  distortion  stemming  from  the 
dispersion  slope  is  clearly  detected  by  the  system. 

Introduction:  In  future  ultra-high-speed  optical  fibre  communica¬ 
tion  systems  employing  optical  time-division  multiplexing 
(OTDM),  picosecond  or  sub-picosccond  optical  pulses  wall  be 
transmitted  through  an  optical  fibre  [1).  Hence,  there  have  been 
strong  demands  for  compact,  sensitive  and  low-cost  systems  for 
measuring  the  transmission  characteristics  of  such  ultra-short  opti¬ 
cal  pulses. 

Intensity  autocorrelators  have  been  most  widely  used  to  esti¬ 
mate  the  pulse  width  by  assuming  that  the  pulse  waveform  has  a 
Gaussian  or  sech2  profile.  However,  a  pulse  transmitted  through 
an  optical  fibre  may  be  asymmetrically  distorted  due  to  the  disper¬ 
sion  slope  and/or  the  Kerr  nonlinearity  of  the  fibre,  and  it  is  very 
difficult  to  evaluate  such  distorted  pulse  waveforms  using  intensity 
autocorrelators.  On  the  other  hand,  optical  sampling  systems  can 
more  precisely  determine  the  pulse  waveform.  However,  in  such 
systems,  we  have  to  prepare  two  sets  of  optical  pulse  sources  in 
which  the  repetition  rates  are  highly  synchronised  [2,  3].  One  pulse 
source  generates  the  optical  pulse  under  test,  and  the  other  gener¬ 
ates  the  sampling  pulse,  the  width  of  which  should  be  much 
shorter  than  that  of  the  pulse  under  test. 

In  this  Letter,  we  propose  an  alternative  approach  to  picosec¬ 
ond  pulse  waveform  measurement,  which  is  more  compact  more 
sensitive,  and  much  easier  to  handle  than  conventional  methods. 
The  proposed  system  has  three  features  that  distinguish  it  from 
the  others.  First,  a  10GHz,  1.7ps  pulse  under  test  is  generated 
from  an  external-cavity  modelocked  semiconductor  laser,  which 
ensures  much  more  stable  operation  against  environmental  effects 
than  modelocked  fibre  lasers.  Secondly,  a  200fs  sampling  pulse  is 
obtained  from  the  picosecond  pulse  under  test  by  pulse  compres¬ 
sion.  By  taking  the  cross-correlation  between  the  two  kinds  of 
pulse,  we  can  diagnose  the  pulse  waveform  under  test.  Thirdly,  we 
introduce  an  Si  avalanche  photodiode  (APD)  as  a  two-photon 
absorption  (TP A)  receiver  into  the  system.  The  combination  of  a 
nonlinear  optical  crystal  and  a  highly  sensitive  photodetecior  used 
in  conventional  autocorrelators  and  optical  sampling  systems  can 
be  replaced  by  an  Si  APD,  and  hence  the  system  construction  is 
greatly  simplified.  In  addition,  an  Si  APD  can  greatly  improve  the 
sensitivity  of  the  cross-correlator  because  the  avalanche  process 
amplifies  the  TPA-induced  photocurrenl  [3, 4], 

The  10 GHz,  1.7ps  optical  pulse  is  transmitted  through  an  80km 
long  dispersion-shifted  fibre  (DSF),  and  the  asymmetric  distortion 
of  the  pulse  waveform  stemming  from  the  dispersion  slope  pf3)  of 
the  DSF  is  clearly  detected  by  the  cross-correlator  we  have 
developed. 

Experimental  setup:  Fig.  1  shows  the  experimental  setup  for  the 
cross-correlator  used  to  measure  the  waveform  of  1550nm  picosec¬ 
ond  optical  pulses  transmitted  through  a  fibre. 

An  external-cavity  modelocked  semiconductor  laser  with  a  satu¬ 
rable  absorption  section  was  used  as  a  picosecond  pulse  source  [5]. 
When  sinusoidally  modulating  the  reverse  bias  voltage  applied  to 
the  absorption  section  at  the  cavity  resonance  frequency  around 
10GHz,  wc  could  lock  the  repetition  rate  at  the  modulation  fre¬ 
quency.  The  phase-noise  measurement  for  the  generated  pulses 
showed  that  the  timing  jitter  was  suppressed  to  <  0.5  ps.  The 
pulsewidth  (FWHM)  measured  from  the  intensity  autocorrelation 
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Wafer-Fused  Optoelectronics  for  Switching 
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Abstract — Wafer  fusion  technique  for  realization  of  compact 
waveguide  switches  and  three-dimensional  (3-D)  photonic  inte¬ 
grated  circuits  is  investigated  theoretically  and  experimentally. 
Calculations  based  on  beam  propagation  method  show  that  very 
short  vertical  directional  couplers  with  coupling  lengths  from  40 
to  220  /an  and  high  extinction  ratios  from  20  to  32  dB  can  be 
realized.  These  extinction  ratios  can  be  further  improved  using  a 
slight  asymmetry  in  waveguide  structure.  The  optical  loss  at  the 
fused  interface  is  investigated.  Comparison  of  the  transmission 
loss  in  InGaAsP-based  ridge-loaded  waveguide  structures  with 
and  without  a  fused  layer  near  the  core  region,  reveals  an  excess 
loss  of  1.1  dB/cm  at  135  //.m  wavelength.  Fused  straight  vertical 
directional  couplers  have  been  fabricated  and  characterized. 
Waveguides  separated  by  0.6  //m  gap  layer  exhibit  a  coupling 
length  of  62  //m  and  a  switching  voltage  of  about  22  V.  Impli¬ 
cations  for  GaAs-based  fused  couplers  for  850  nm  applications 
will  also  be  discussed. 

Index  Terms — Integrated  optoelectronics,  optical  couplers,  op¬ 
tical  switches,  optical  waveguide  components,  wafer  bonding. 


I.  Introduction 

MAJOR  requirements  for  optical  packet  switching  ele¬ 
ments  are  scalability,  low-loss,  and  low  crosstalk.  In 
addition,  these  structures  should  be  compatible  for  coupling 
with  fiber  ribbon  cables  and  should  incorporate  integrated 
optical  amplifiers  to  compensate  for  losses  associated  with 
fiber  loop  memories.  We  are  investigating  here  the  use  of  wafer 
fusion  to  give  an  extra  degree  of  freedom  in  the  fabrication  of 
coupled  waveguide  structures.  The  technique  of  wafer  fusion 
has  been  used  to  combine  materials  of  very  different  lattice 
constants  which  could  not  be  grown  by  heteroepitaxy  [1]. 
This  technique  can  also  be  used  to  combine  planar  waveguides 
fabricated  on  two  different  substrates  into  a  three  dimensional 
structure  in  which  there  is  vertical  coupling  between  arrays  of 
single  mode  waveguides  through  the  fused  regions  (Fig.  1)  [2]. 
In  addition,  application  of  a  bias  at  fused  areas  will  allow  a 
change  of  gain  or  index  for  switching  purposes.  Because  of  the 
differences  between  fused  vertical  couplers  and  conventional 
planar  couplers  [33— [6],  we  have  studied  theoretically  the  cou¬ 
pling  length  and  extinction  ratio  using  three-dimensional  (3-D) 
beam  propagation  method  [7].  In  order  to  gain  insight  into 
fundamental  limitations  in  the  performance  of  these  strongly 
coupled  structures,  we  have  also  used  improved  coupled  mode 
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theories  where  the  nonorthogonality  of  the  modes  of  individual 
waveguides  is  taken  into  account  explicitly.  From  a  fabrication 
point  of  view,  realization  of  vertical  couplers  requires  a 
detailed  optical  characterization  of  the  loss  and  uniformity  of 
the  fused  interface.  In  the  following,  after  discussion  of  various 
design  issues  for  fused  vertical  couplers,  their  fabrication  and 
characterization  are  described. 

n.  Beam  Progagation  Method  Analysis 

In  order  to  calculate  the  coupling  length  and  the  extinction 
ratio  in  these  two-dimensional  (2-D)-fused  waveguide  struc¬ 
tures  a  3-D  finite  difference  beam  propagation  program  is 
used.  The  fused  vertical  coupler  (FVC)  is  shown  in  Fig.  2. 
A  single-mode  ridge-loaded  waveguide  structure  based  on  InP 
substrate,  with  0.5  /zm  InGaAsP  (Agap  =  1,3  /xm)  core  region, 
0.1  (im  cladding  and  0.1  /zm  ridge  height,  is  vertically  coupled 
through  a  fused  gap  layer  to  an  identical  waveguide.  The  gap 
layer  thickness  is  varied  from  0.1  to  0.6  micron  with  its  index 
ranging  from  InP  to  InGaAsP  (Agap  =  1.4  /zm). 

Fig.  3(a)  displays  the  coupling  length  for  different  param¬ 
eters  of  the  gap  layer.  As  expected,  increasing  the  gap  layer 
index  reduces  the  coupling  length.  In  a  coupled-mode  picture, 
this  can  be  explained  by  an  increase  in  the  overlap  integral 
of  the  two  modes  of  adjacent  waveguides.  On  the  other 
hand,  the  dependence  of  the  coupling  length  on  the  gap  layer 
thickness  shows  a  mixed  behavior.  When  the  gap  region  has 
small  indexes  close  to  InP  layer,  increasing  its  thickness  will 
decouple  the  two  waveguides  and  thus  increases  the  coupling 
length.  However,  when  the  index  of  the  gap  layer  is  large 
(close  to  1.3  fim  quartenary),  the  mode  amplitude  in  this 
region  is  not  anymore  exponentially  decaying,  but  sinusoidal. 
So  a  thicker  gap  layer  will  increase  the  overlap  integral 
between  modes  of  adjacent  waveguides  and  thus  reduces  the 
coupling  length.  When  the  gap  layer  thickness  is  more  than 
0. 3-0.4  fi m,  an  analysis  based  on  the  supermodes  of  three 
coupled  waveguide  is  more  appropriate,  but  the  appearance 
of  undesirable  modes  in  the  gap  layer  will  deteriorate  the 
performance  of  the  directional  coupler. 

In  order  to  quantify  the  effect  of  higher  order  modes, 
power  transfer  between  two  waveguides  was  analyzed.  The 
eigenmode  of  one  of  the  uncoupled  waveguides  was  taken  for 
the  input  field,  and  power  transfer  to  the  other  waveguide  as  a 
function  of  propagation  distance  was  monitored  by  the  beam 
propagation  method  (BPM)  simulation.  Fig.  3(b)  displays  the 
extinction  ratio  defined  as  the  ratio  of  mode  powers  in  the 
two  waveguides  after  a  coupling  length.  When  the  gap  layer 
is  thick  and  its  index  is  high,  the  coupler  has  poor  extinction 
ratios  from  5  to  10  dB.  In  this  case  BPM  simulation  reveals  3-4 
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Fig.  I.  InP  crossbar  switch  based  on  coupling  between  independent  arrays  of  waveguides  on  each  substrate. 


inPSubstratc 


In  P  Substrate 


Fig.  2.  The  coupling  region  between  two  fused  waveguides,  and  the  param¬ 
eters  used  for  BPM  simulations  (A  =  1.55  // m,  n(InP)  =  3.17,  n(InGaAsP) 
-  3.37). 

supermodes  in  the  coupling  region.  In  addition  to  the  expected 
symmetric  and  antisymmetric  eigenmodes,  there  are  modes  of 
the  gap  layer  and  some  leaky  modes.  But  for  a  wide  range 
of  parameters  (gap  thickness  from  0.2  to  0.6  ^m,  and  gap 
index  from  3.2  to  3.4),  extinction  ratios  from  20  to  32  dB 
can  be  achieved.  From  Fig.  3(a)  we  see  that  this  corresponds 
to  coupling  lengths  of  the  order  from  40  to  220  fim.  Since 
the  two  waveguides  are  very  close,  it  is  almost  impossible  to 
excite  only  of  the  them  and  to  measure  the  extinction  ratio 
experimentally.  In  practice,  the  two  ridge  structures  will  be 
separated  by  curved  regions  and  on/off  ratio  is  limited  by 
unwanted  couplings  at  regions  where  the  waveguides  join 
together.  The  above  analysis,  however,  shows  the  inherent 
limitation  in  extinction  ratios. 

In  these  symmetric  ultra  short  couplers,  the  main  problem  to 
achieve  low  extinction  ratio  is  nonorthogonality  of  the  modes 
of  individual  waveguides  [8],  [9].  Using  a  slight  asymmetry, 
one  can  improve  extinction  ratios  to  arbitrary  small  values. 
To  see  this,  lets  consider  the  one  dimensional  index  profile  of 
the  coupling  region  of  FVC  in  Fig.  2  with  0.2  ^xm  InP  gap 
layer.  Fig.  4  shows  calculations  based  on  improved  coupled 
mode  theories,  where  corrections  due  to  overlap  integrals  are 
incorporated  [8].  BPM  simulations  agree  very  well  with  these 
results.  When  the  gap  layer  thickness  is  0.2  //m,  the  asymmetry 
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Fig.  3.  (a)  The  coupling  length  and  (b)  the  extinction  ratio  as  a  function  of 
gap  layer  index  for  different  thicknesses  of  the  gap  layer. 
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Fig.  4.  Index  of  waveguide  A  for  maximum  extinction  ratio  and  tolerance  on 
that  to  achieve  >30  dB  extinction  ratio  as  a  function  of  waveguide  separation. 


Waveguide  Width  (pm) 


Fig.  5.  Coupling  length  for  the  vertical  coupler  and  conventional  planar 
coupler  as  a  function  of  waveguide  width. 

in  core  waveguide  index  required  to  achieve  the  highest 
extinction  ratio  for  TE  (TM)  mode  is  0.003  (0.004).  Having 
the  core  index  within  ±0.019  (±0.021%)  of  the  optimum 
value  for  TE  (TM)  mode,  one  can  achieve  the  extinction 
ratio  larger  than  30  dB  (Fig.  4).  As  the  separation  between 
the  two  waveguides  decreases,  more  asymmetry  is  needed  but 
the  tolerance  to  obtain  an  extinction  ratio  larger  than  30  dB 
increases.  This  facilitates  fabrication  of  passive  asymmetric 
couplers  or  setting  the  voltage  or  current  in  active  components. 
An  intuitive  picture  is  that  a  slight  asymmetry  can  equalize 
the  overlap  integral  of  the  single  waveguide  mode  with  the 
symmetric  and  antisymmetric  supermodes  of  the  coupler  and 
thus  increase  the  extinction  ratio. 

Vertical  coupling  through  the  ridge  structure  whose  height 
is  defined  by  etch-stopping  techniques  is  much  less  sensitive 
to  the  ridge  waveguide  width  and  sidewall  smoothness  than 
the  planar  waveguide  couplers.  In  fact,  the  difficulty  in  making 
reproducibly  and  uniformly  very  narrow  gap  «1  /;m)  couplers 


Wet  Etching 


Fig.  6.  Different  steps  of  fabrication  process  for  single-mode  waveguide 
structure  with  a  fused  interface  near  core  region. 

have  mitigated  their  development  for  ultra  short  switching 
devices.  Fig.  5  shows  the  coupling  length  as  a  function  of 
waveguide  width  for  the  case  of  a  conventional  ridge-loaded 
structure  with  0.5  /xm  InGaAsP  (Agap  =  1.3  /xm)  core  layer, 
0.1  /xm  InP  slab  layer  and  0.2  /xm  InP  ridge.  The  center  of 
the  two  waveguides  are  separated  by  6  /xm.  It  can  be  seen 
that  a  change  of  1  /xm  in  waveguide  width  will  change  the 
coupling  length  by  30—40%.  When  the  same  waveguides  are 
coupled  vertically,  the  coupling  length  is  about  two  orders 
of  magnitude  smaller  and  at  the  same  time  less  sensitive  to 
waveguide  width  variation  (4—5%  change  in  coupling  length 
for  one  micron  change  in  waveguide  width). 

Another  requirement  for  optical  switches  is  polarization 
insensitivity.  The  fused  vertical  coupler  shown  in  Fig.  2  has 
coupling  length  for  TM  polarized  light  at  1.55  /xm  wavelength 
which  is  7—13%  shorter  than  the  TE  one  (for  gap  layer 
thicknesses  between  0. 1-0.6  /xm).  However,  it  is  possible  to 
make  the  switch  polarization  insensitive  using  the  difference 
in  materials  dispersion,  e.g.,  by  combining  GaAs  and  InP 
waveguides  [10]. 

in.  Optical  Loss  at  the  Fused  Interface 

The  single  mode  waveguide  structure  was  based  on 
MOCVD  grown  material  with  0.5  /xm  InGaAsP  (Agap  = 
1.3  /xm)  guiding  layer,  0.24  /xm  cladding  layer  which 
includes  two  0.1  /im  InP  layers  and  two  0.02  /x m  InGaAsP 
(Agap  =  1.15  /xm)  etching  stop  layers,  and  finally  0.6  /xm 
InP  ridge  layer.  For  the  purpose  of  comparison,  we  use  the 
same  wafer  and  single  mode  waveguide  geometry  with  and 
without  a  fused  interface  near  the  core  region.  The  control 
waveguide  has  3  /xm  wide  and  0.6  /xm  high  ridges  defined 
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Fig.  8.  The  stain-etched  SEM  picture  of  a  fused  vertical  coupler. 


using  wet-etching  techniques.  To  fabricate  the  single  mode 
fused  waveguide,  two  1  x  0.8  cm2  samples  are  cut  from  the 
MOCVD  grown  wafer.  First,  the  0.6  fim  InP  layer  and  0.1 
cladding  layer  of  one  sample  are  removed  using  selective 
wet-etching  (Fig.  6);  then  10  fim  wide,  0.6  /im  deep  channels 
with  160  /urn  spacing  are  opened  in  a  second  sample.  The  two 
samples  are  then  fused  at  630  °C  in  a  hydrogen  atmosphere  for 
30  min.  Subsequently,  InP  substrate  and  0.5  /im  guiding  layer 
of  the  top  wafer  is  removed  and  3  fxm  wide  ridge  waveguides 
are  fabricated  using  wet  etching.  Fig.  7  shows  a  stain-etched 
SEM  picture  of  finished  device.  The  fused  interface  can  not 
be  seen  in  this  picture.  This  is  an  indication  of  the  high 
quality  of  the  fused  interface.  The  presence  of  the  channels 
prior  to  fusion  is  crucial.  Without  these  channels,  we  could 
see  microscopic  voids  at  the  fused  junction  and  many  of  the 
fabricated  waveguides  did  not  show  clear  eigenmodes. 

Fabry-Perot  resonance  technique  was  used  to  measure  the 
optical  propagation  loss  [2].  The  optical  loss  of  the  unfused 
waveguide  is  about  2.4  dB/cm,  while  the  fused  structure 
showed  a  loss  of  3.5  dB/cm  at  1.55  ^m.  Since  the  geometry 
and  materials  are  identical,  the  1.1  dB/cm  excess  loss  should 
be  attributed  to  the  fused  interface.  BPM  calculations  indicate 
that  the  field  strength  at  the  center  of  the  fused  interface  is  49% 
of  the  maximum  field.  The  issue  of  waveguide  uniformity  for 
large  scale  monolithic  integration  is  very  important.  The  size 
of  our  fused  wafers  is  about  1  x  0.8  cm2.  After  thinning 
and  cleaving,  the  size  of  the  sample  for  measurement  is  about 
6x6  mm2.  The  yield  of  the  fused  waveguides  is  more  than 
90%  which  is  almost  same  as  unfused  sample.  The  existence  of 
channels  in  fused  sample  provides  150  fi m  wide  multimode 
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Fig.  9.  Measured  intensity  of  the  upper  and  lower  waveguides  as  a  function 
of  wavelength. 

slab  waveguides  that  contain  a  fused  interface.  We  did  not 
notice  any  “dark”  spot  in  these  multimode  structures. 

We  have  also  investigated  mass  transport  at  the  fused 
interface  by  comparing  samples  with  different  effective  fused 
areas.  In  conventional  fused  structures,  after  fabrication  of 
narrow  channels  on  one  of  the  wafers  prior  to  fusion,  typically 
over  90%  of  the  surface  of  the  samples  is  in  contact  during  the 
fusion  process.  We  studied  samples  where  the  fusion  was  only 
over  the  surface  on  the  top  of  waveguides  (3-6  ^m  thickness, 
separated  by  125  //m,  and  about  1  cm  long).  In  this  case  only 
4%  of  the  surface  of  the  two  wafers  is  in  contact  during  fusion. 
We  did  not  notice  any  substantial  degradation  or  nonuniformity 
in  the  ridge  waveguide  structure. 
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Fig.  10.  Measured  bias  dependence  of  the  output  light  intensity  in  upper  and 
lower  waveguides  for  a  3.5  mm  fused  vertical  switch.  The  top  pictures  are 
photographs  of  the  near  field  pattern  at  the  output  of  the  coupler  at  0  and  2.2  V. 


Fig.  11.  Asymmetric  fused  vertical  coupler. 


IV.  Fused  Vertical  Directional  Coupler  and  Switch 

The  structure  of  the  coupler  is  identical  to  that  in  Fig.  2, 
with  0.2  /im  thick  InP  gap  layer.  The  material  was  grown  by 
MOCVD  and  consisted  of  a  0.5  InGaAsP  (Agap  =  1.3  pm) 
guiding  layer  on  InP  substrate,  followed  by  0.1  /im  InP 
cladding  layer,  20  nm  InGaAsP  (Agap  =  1.15  pm)  etch  stop 
layer  and  0.4  pan  InP  coupling  layer.  To  fabricate  the  vertical 
coupler,  two  8  x  10  mm2  samples  are  cleaved  from  the 
grown  wafer.  In  the  first  sample  the  top  0.4  fx m  InP  layer  is 
removed.  On  the  second  sample,  a  ridge  waveguide  structure 
is  fabricated  using  standard  photolithography  and  selective  wet 
etching.  The  ridges  have  3-6  fim  width,  0.4  pm  height  and 
they  are  separated  by  125  (i m.  The  two  samples  are  then  fused 
together  at  a  temperature  of  630°  C  in  a  hydrogen  atmosphere 
for  30  min.  Fig.  8  shows  the  stain-etched  SEM  picture  of  a 


Index  n 

9 


(b) 

Fig.  12.  (a)  The  coupling  length  and  (b)  the  extinction  ratio  as  a  function 
of  gap  layer  index  for  different  thicknesses  of  the  gap  layer,  for  asymmetric 
fused  vertical  coupler. 

finished  fused  vertical  coupler  (FVC).  The  fused  interface  is 
not  visible,  even  after  staining.  There  is  mass  transport  at  the 
edge  of  the  ridge.  This  is  beneficial  to  get  a  symmetric  coupler 
and  improves  the  side  wall  flatness. 

The  near  field  pattern  at  the  output  of  FVC’s,  4.5  mm  long, 
is  recorded  by  an  IR  camera.  The  light  is  input  from  a  8 
/im  diameter  single  mode  1.55  /;m  fiber.  It  can  be  seen  that 
by  changing  the  input  wavelength,  light  is  switched  from  the 
upper  to  the  lower  waveguide.  Fig.  9  shows  the  intensities  of 
the  upper  and  lower  waveguides  as  a  function  of  wavelength. 
Our  measurements  show  that  the  extinction  ratio  can  be  >15 
dB.  This  is  particularly  difficult  to  achieve  in  conventional 
high  mesa  vertical  couplers  [3]-[6],  From  the  oscillation 
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Fig.  13.  SEM  pictures  of  the  separated  top  and  bottom  waveguides  for  an  asymmetric  fused  vertical  coupler.  The  bottom  picture  is  a  photograph  of  the 
near  field  pattern  at  the  output  of  the  coupler  where  the  two  waveguides  are  separated  by  250  ft  m. 


period  (about  12  nm)  and  considering  material  and  waveguide 
dispersions,  the  index  difference  between  the  even  and  odd 
modes  can  be  calculated  which  is  0.0121.  The  corresponding 
coupling  length  is  62  fxm  at  1.55  fx m,  that  agrees  very  well 
with  58  (xm  result  from  BPM  simulations.  Finally  a  PIN  FVC 
was  fabricated  and  characterized.  The  structure  was  identical 
to  the  one  in  Fig.  8,  the  only  difference  is  that  the  top  InP 
substrate  is  p-doped  and  the  bottom  wafer  n-doped.  It  can  be 
seen  in  Fig.  10  that  with  a  bias  of  2.2  V  the  light  is  switched 
from  the  top  to  the  bottom  waveguide.  Electro-optic  effect 
at  such  a  small  bias  is  not  enough  to  explain  the  switching. 
We  believe  that  thermo-optic  effect  plays  a  major  role  in  this 
device  because  of  the  high  leakage  current  and  local  heating 
of  the  coupler  [11]. 

V.  ASYMMETRIC  FUSED  VERTICAL  DIRECTIONAL  COUPLER 

In  order  to  fabricate  multilevel  3-D  photonic  integrated 
circuits,  an  asymmetric  fused  structure  is  needed  (see  Fig.  11). 
By  repeating  the  fusion  process,  one  can  obtain  multiple  layers 
of  waveguide  interconnects.  For  fabrication,  first,  a  set  of 
ridge  waveguides  on  a  InP  wafer  are  defined  using  the  usual 
wet-  and  dry-etching  techniques.  Subsequently,  a  wafer  is 
bonded  on  top  of  the  waveguides.  After  removing  the  substrate 
of  this  top  wafer  using  selective  etching,  a  second  set  of 
waveguides  is  fabricated.  These  top  waveguides  are  coupled 
vertically  to  the  waveguides  beneath  them  in  areas  where  the 
two  structures  are  connected  by  wafer  fusion.  The  issue  of 
alignment  in  the  coupling  regions  is  facilitated  using  infrared 
photolithography.  Even  though  the  top  and  bottom  waveguides 
are  very  dissimilar,  by  matching  the  modes  effective  indexes, 
one  can  theoretically  achieve  coupling  lengths  of  the  order  of 
60  fxm  with  25  dB  extinction  ratio  [see  Fig.  12(a)  and  (b)]. 


Fig.  13  displays  the  SEM  cross  section  of  the  bottom  and 
top  waveguides  along  with  the  near  field  infrared  image  at 
the  output  of  an  InGaAsP  asymmetric  coupler  where  the  two 
waveguides  are  separated  by  250  fxm. 

VI.  Prospects  for  GaAs  Fusion 

Wafer  fusion  technique  can  also  be  used  to  fabricate  vertical 
couplers  in  other  materials.  To  investigate  the  optical  loss 
due  to  fusion  process  in  AlGaAs,  a  single  mode  waveguide 
structure  similar  to  the  one  in  Fig.  7  was  fabricated.  The 
reference  structure  was  based  on  GaAs  substrate,  with  2.5 
(x m  Alo.4Gao.6As  bottom  cladding  layer,  0.2  [xm  Alo.15Gao.85As 
core  region,  0.3  /zm  Alo.4Gao.5As  slab  layer  and  0.6  fx m 
Alo.4Gao.5As  ridge.  The  waveguides  were  3  fxm  wide  and 
they  exhibited  a  loss  of  about  5  dB/cm.  This  high  loss 
was  due  to  the  poor  quality  of  the  sidewalls.  SEM  pictures 
revealed  a  roughness  on  the  order  of  one  micron.  Single  mode 
waveguides  with  a  fused  interface  in  the  middle  of  the  slab 
layer  (i.e.,  0.15  fxm  away  from  the  core  region),  had  an  excess 
loss  of  about  3  dB/cm  at  1.55  fxm  wavelength. 

It  will  be  difficult  to  achieve  ultra  short  coupling  lengths 
with  fused  vertical  couplers  at  shorter  wavelengths  (e.g.,  850 
nm).  If  the  single  mode  AlGaAs  waveguide  structure  described 
above  has  only  0. 1  fx m  ridge  height  and  is  coupled  vertically 
to  an  identical  waveguide,  the  coupling  length  at  850  nm  is 
over  500  fxm.  The  same  structure  at  1.55  \x m  wavelength  will 
have  a  coupling  length  of  about  5  fxm. 

VTL  Summary  and  Conclusion 

In  conclusion,  wafer  fusion  technique  for  fabrication  of 
vertical  couplers  and  switches  is  described.  Very  short  direc¬ 
tional  couplers  with  a  coupling  length  from  40  to  220  /zm 
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and  high  extinction  ratios  from  20  to  32  dB  can  be  realized. 
These  extinction  ratios  can  be  further  increased  using  a  slight 
asymmetry  in  waveguide  structure.  It  is  shown  that  1.1  dB/cm 
excess  optical  loss  is  introduced  due  to  fusion  process  in 
InP  based  waveguides  at  1.55  pm  wavelength.  Fused  straight 
vertical  directional  couplers  separated  by  0.6  pm  gap  layer 
exhibit  a  coupling  length  of  62  pm  and  a  low  switching 
voltage  of  about  2.2  V.  These  fused  waveguides  give  us  the 
added  advantage  of  vertical  dimension  by  separating  the  input 
and  output  waveguides  to  realize  compact  and  scalable  3-D 
photonic  integrated  circuits. 
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Abstract — A  novel  vertical  coupler  filter  based  on  fused  In- 
GaAsP-InP  and  AlGaAs-GaAs  waveguide  structures  is  proposed 
and  demonstrated.  The  large  material  dispersion  difference  be¬ 
tween  InGaAsP  and  AlGaAs  around  1.55  //m  and  similar  wave¬ 
guide  geometries  are  used  to  realize  a  less  sensitive  polarization 
and  narrow-band  filter  with  two  separated  inputs  and  outputs 
and  more  than  -40-dB  sidelobe  suppression  should  be  possible. 

Index  Terms —  Integrated  optoelectronics,  optical  directional 
couplers,  optical  waveguide  filters,  wafer  bonding. 

INTEGRATED  compact  and  narrow-band  optical  filters 
are  key  components  for  dense  wavelength  division  mul¬ 
tiplexing  (DWDM)  systems  as  add/drop  multiplexers  and 
demultiplexers.  To  date,  many  types  of  add/drop  filters  have 
been  proposed  and  realized  including  diffraction  gratings, 
arrayed- waveguide  gratings,  Mach— Zehnder  interferometers 
and  directional  couplers.  Compared  to  other  structures,  asym¬ 
metric  directional  coupler  filters  [1]— [3]  using  two  dissimilar 
waveguide  on  III-V  semiconductors  are  promising  because 
of  the  precise  control  of  waveguide  thickness  and  indexes 
during  crystal  growth  and  monolithic  integration  with  other 
devices  such  as  optical  amplifiers,  photodectors,  modulators, 
and  lasers.  There  are,  however,  several  obstacles  to  using 
these  vertical  coupler  structures  in  system  applications.  The 
characteristics  are  strongly  polarization  dependent,  launch¬ 
ing  the  light  into  and  coupling  light  out  of  two  very  close 
waveguide  are  very  difficult  and  the  coupling  efficiency  for 
two  dissimilar  waveguide  geometries  can  be  very  different. 
In  this  letter,  a  novel  vertical  coupler  filter  (VCF)  based  on 
wafer  fusion  technology  is  proposed  and  demonstrated.  This 
combines  two  different  material  systems:  AlGaAs-GaAs  with 
a  low  material  dispersion  at  1.55  m  and  InGaAsP  with  a  very 
high  material  dispersion.  With  a  proper  design,  a  narrow-band 
and  polarization  independence  filter  with  two  separated  inputs 
and  outputs  can  be  easily  realized,  which  solves  all  of  the 
above  problems. 

It  is  well  known  that  the  response  bandwidth  of  the  asym¬ 
metrical  directional  coupler  is  inversely  proportional  to  both 
the  device  length  and  the  difference  of  mode  dispersion  in 
the  two  waveguide  a  =  ( dni/dX )  —  ( drio/dX ),  where  n i  and 
ri2  are  effective  indices  of  the  two  waveguide  eigenmodes. 
To  minimize  the  device  length  and  reduce  the  sidelobes,  the 
filter  bandwidth  can  only  be  narrowed  by  increasing  a.  The 
modal  dispersion  depends  on  two  factors.  The  first  one  is 
waveguide  dispersion  that  depends  on  waveguide  geometry; 
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the  other  one  is  the  material  dispersion.  Vertical  coupler 
filters  realized  up  to  now  use  mainly  waveguide  dispersion 
difference  [l]-[3].  A  narrow  bandwidth  requires  one  of  the 
waveguide  to  have  a  very  small  index  difference  between 
the  core  and  the  cladding,  and  a  large  core  size,  while  the 
other  one  should  have  a  large  index  difference  and  a  small 
core  size.  To  keep  the  single-mode  operation  and  a  high 
coupling  efficiency  with  fibers,  the  waveguide  core  size  can 
not  be  too  large  or  too  small,  and  this  limits  the  bandwidth 
of  the  filter.  On  the  other  hand,  the  difference  in  effective 
indices  between  TE  and  TM  modes  for  a  waveguide  with 
a  large  index  difference  and  small  core  is  much  more  than 
the  TE  and  TM  difference  of  another  waveguide  with  small 
index  difference  and  large  core  size.  Consequently,  these 
devices  have  a  strong  polarization  dependence.  Generally  the 
polarization  dependent  wavelength  shift  is  more  than  30  nm 
[4],  [5]  and  that  is  a  disadvantage  in  fiber  optic  communication 
systems.  A  birefringence  compensation  technique  [5]  has  been 
used  to  solve  this  problem.  But  this  needs  a  complicated 
structure  design  and  a  critical  material  growth.  Since  the  po¬ 
larization  dependence  and  different  coupling  efficiency  come 
from  the  strong  asymmetry  of  two  waveguide  geometries, 
these  problems  can  be  solved  if  the  two  waveguide  have 
similar  structures.  For  these  waveguide  with  almost  identical 
waveguide  dispersions,  a  large  material  dispersion  difference 
between  two  waveguide  is  needed  to  realize  a  narrow-band 
polarization  independent  filter.  It  is  known  that  a  material 
has  strong  dispersion  when  the  operation  wavelength  is  near 
the  band  gap.  So  InGaAsP  material  can  have  much  higher 
dispersion  than  AlGaAs  material  around  1.55  and  1.3  fim.  For 
example,  the  material  dispersion  of  InGaAsP  (A#  =  1.45  ^m) 
at  1.55  fjm  is  -OAS/tim  [6],  which  is  almost  one  order  of 
magnitude  higher  than  that  of  Alo.1Gao.9As,  — 0.059/^m  [7]. 
Such  a  large  dispersion  difference  is  very  difficult  to  obtain  if 
one  only  uses  different  waveguide  geometries.  Unfortunately, 
because  of  the  large  lattice  mismatch,  good  quality  InP  can  not 
be  grown  on  GaAs  substrate  or  vise  versa.  Recently,  a  new 
technique  called  wafer  fusion  [8]  or  wafer  bonding  [9]  has 
been  developed  that  combines  two  materials  with  a  large  lattice 
mismatch.  Wafer  fusion  can  also  be  used  to  fabricate  three- 
dimensional  (3-D)  photonic  devices.  For  conventional  vertical 
coupler  filters,  the  difficulty  of  separating  the  two  waveguide 
limits  its  application  to  WDM  systems.  Using  wafer  fusion,  the 
two  close  inputs  and  two  outputs  can  be  easily  separated  in 
different  planes  [10],  [11].  Fig.  1(a)  is  the  schematic  drawing 
of  a  fused  InGaAsP-InP-AlGaAs-GaAs  vertical  coupler  filter 
with  separated  inputs  and  outputs. 

The  detailed  structure  of  the  proposed  fused  vertical  coupler 
filter  is  illustrated  in  Fig.  1(b).  The  upper  InGaAsP-InP  wave¬ 
guide  consists  of  a  0.4-^m  InGaAsP  (A g  =  1.45  fira )  guiding 
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(a)  (b) 

Fig-  1.  (a)  The  proposed  fused  vertical  coupler  with  separated  inputs  and 
outputs  for  add/drop  applications,  (b)  The  detailed  structure  of  fused  to- 
GaAsP-InP  and  AIGaAs-GaAs  filter. 


Fig.  2.  The  calculated  waveguide  dispersions  and  total  dispersions  of  In- 
GaAsP-InP  and  AlGaAs— GaAs  waveguide. 


Fig.  3.  The  calculated  effective  index  for  TE  and  TM  modes  The  to- 
GaAsP-toP  waveguide  has  0.4-,, m  quaternary  (A.,  =  1.37  ,,m)  core  and 
0.1-pm  InP  cladding  layer.  The  AIGaAs-GaAs  waveguide  is  similar  as  before 
except  the  core  thickness  is  0.46  fim. 


Fig.  4.^  The  response  of  an  X-crossing  fused  vertical  coupler  filter  with 
0  =  0°  and  0.16°.  ds  =  1.2  //m.  The  insert  is  the  schematic  drawing  of 
X-crossing  fused  vertical  coupler  filter. 


layer  and  an  InP  cladding  layer.  The  lower  AIGaAs-GaAs 
waveguide  includes  a  0.53  /urn  Alo.1Gao.9As  core  and  a  0.2 
jtm  Alo.5Gao.5As  cladding  layer.  Those  two  waveguide  are 
phase  matched  at  1.55  //m.  Fig.  2  shows  waveguide  and  total 
dispersions  of  3  /z m  wide  upper  InGaAsP-InP  and  lower 
AIGaAs-GaAs  waveguide  calculated  by  a  transfer  matrix 
method  with  effective  index  approximation.  The  index  data 
are  taken  from  [6]  and  [7].  The  waveguide  dispersions  of  the 
two  waveguide  are  very  small  and  almost  identical,  so  the 
material  dispersion  dominates  in  our  vertical  coupler  filter.  In 
the  current  structure,  because  of  similar  waveguide  structures, 
there  is  only  8-nm  polarization  dependent  wavelength  shift 
which  is  much  less  than  the  value  of  more  than  30  nm  in 
the  conventional  vertical  coupler  filter.  It  is  easy  to  realize 
polarization  independent  vertical  coupler  filters  by  replacing 
1.45 -/an  quaternary  with  1.37-jtm  quaternary.  This  is  shown 
in  Fig.  3.  Since  the  material  dispersion  of  1.37-mm  quaternary 
is  a  little  lower  than  that  of  1.45 -,9m  quaternary,  a  small 
bandwidth  will  be  sacrificed  in  this  structure. 

Using  a  3-D  beam  propagation  method  (BPM),  the  perfor¬ 
mance  of  fused  filters  is  simulated.  When  the  separations  of 
two  waveguide  d,  =  1.2,  1.6,  and  2  p m,  the  corresponding 
coupling  lengths  (100%  power  transfer)  are  1,  4.5  mm,  and 


2  cm  and  the  bandwidths  are  4,  0.8,  and  0.2  nm  at  the 
coupling  length.  As  we  expected,  the  central  wavelength  is 
independent  of  the  separation  distance  of  two  waveguides  and 
the  bandwidth  is  inversely  proportional  to  the  coupler  length. 
Because  of  uniform  coupling,  there  is  a  -9-dB  sidelobe, 
which  is  too  high  for  practical  application.  By  using  an  X- 
crossing  structure  [12],  the  sidelobe  can  be  suppressed  to  more 
than  —40  dB,  which  satisfies  the  requirement  of  most  WDM 
systems.  Fig.  4  shows  the  calculated  response  of  an  X-crossing 
fused  vertical  coupler  with  a  crossing  angle  0  =  0.16°  using 
coupled  mode  theory.  One  should  note  that  the  fabrication  of 
an  X-crossing  vertical  coupler  filter  structure  with  separated 
inputs  and  outputs  is  very  easy  with  the  use  of  wafer  fusion 
technology. 

We  have  fabricated  a  fused  straight  (0  -  0°)  vertical 
coupler  filter  based  on  MBE  grown  GaAs  and  MOCVD 
grown  InP  waveguide.  The  structure  is  shown  in  Fig.  1(b) 
and  the  separation  of  the  two  waveguides  is  1.2  /tm,  which 
corresponds  to  1-mra  coupling  length.  A  3  mm  long  device 
has  been  measured.  Fig.  5  shows  the  measured  response. 
The  3-dB  bandwidth  is  1.2  nm,  which  agrees  very  well 
with  theoretical  value  of  1.3  nm.  The  measured  coupling 
efficiency  (the  optical  power  from  the  output  waveguide 
divided  by  the  sum  of  the  optical  power  from  both  the 
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Fig.  5.  The  measured  response  of  a  5-//m-wide  fused  vertical  coupler. 

output  and  input  waveguide)  of  the  current  device  is  about 
50%.  The  polarization  dependent  wavelength  shift  is  only 
5  nm.  Theoretical  calculations  predict  a  7 -run  shift,  and 
as  shown  in  Fig.  3,  this  polarization  dependence  can  be 
eliminated  with  proper  design.  To  optimize  the  design  of 
fused  InP-GaAs  coupler,  one  should  take  into  account  material 
losses.  The  bandgap  of  InGaAsP  guiding  layer  is  closer 
than  GaAs-AlGaAs  material  to  the  operation  wavelength  at 
1.55  ptm.  One  thus  expects  larger  band-to-band  absorption  in 
InGaAsP  waveguides.  However  in  doped  materials,  free  carrier 
absorption  increases  and  dominates  when  the  wavelength  is 
farther  from  the  bandgap.  We  have  measured  4— 6-dB/cm  opti¬ 
cal  propagation  loss  for  both  InP-InGaAsP  and  GaAs— AlGaAs 
waveguides  at  1.55  /xm.  We  did  not  notice  substantial  increase 
in  loss  for  the  lower  bandgap  material  InGaAsP.  The  fusion 
process  adds  1-3-dB/cm  loss  [10]. 

In  conclusion,  a  novel  fused  vertical  coupler  filter  has  been 
demonstrated.  Due  to  its  inherent  polarization-independence 
and  narrowband  that  comes  from  a  large  material  dispersion 


difference  between  InGaAsP-InP  and  AlGaAs-GaAs  wave¬ 
guide,  this  kind  of  fused  filters  will  be  very  promising  in  WDM 
systems.  Furthermore,  these  fused  waveguide  structures  can 
be  used  to  realize  tunable  wavelength  lasers  and  wavelength 
selective  detectors. 
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Abstract — Electroabsorption  modulators  with  traveling- wave 
electrodes  have  been  designed  and  fabricated  using  MOCVD 
grown  InGaAsP-InGaAsP  quantum  wells.  A  modulation  band¬ 
width  of  25  GHz  is  achieved  for  a  2-// m-wide  300-//m-long  device. 
Driving  voltage  of  120  V  is  achieved  for  an  extinction  ratio  of  20 
dB  for  operation  at  1.55  //m. 

Index  Terms — Electroabsorption,  optical  fiber  communication, 
optical  modulation,  quantum-confined  Stark  effect,  quantum-well 
devices,  traveling-wave  devices. 


I.  Introduction 

ELECTROABSORPTION  (EA)  modulators  axe  very  at¬ 
tractive  devices  for  optical  fiber  communications  because 
of  their  very  low  driving  voltage,  very  high  modulation 
efficiency  and  integratibility  with  lasers  [1],  However,  con¬ 
ventional  EA  modulators  are  lumped-electrode  devices,  whose 
speeds  are  limited  by  the  total  parasitics  of  the  devices, 
which  restricts  the  devices  to  very  short  length  for  high-speed 
operation  [2]— [4],  One  way  to  overcome  the  parasitic  limitation 
is  by  integrating  passive  waveguides  with  a  short  absorption 
section  [2]-[3],  However,  this  kind  of  device  with  short  active 
regions  has  lower  extinction  ratio  and  requires  relatively  higher 
driving  voltage.  On  the  other  hand,  a  traveling  wave  electrode 
structure  can  overcome  the  RC  limitation  and  allows  for  longer 
devices  for  the  same  speed  requirement,  which  potentially 
lowers  the  required  driving  voltage,  increases  the  extinction 
ratio  and  the  optical  saturation  power  [5]— [7]. 

In  this  letter,  we  demonstrate  1.55-^m  traveling-wave 
EA  modulators  (TEAM)  fabricated  with  MOCVD  grown 
InGaAsP-InGaAsP  quantum  wells.  We  also  examine  the  effect 
of  load  termination  on  the  high-speed  performance. 

II.  Design  and  Fabrication 

A  coplanar  waveguide  (CPW)  electrode  structure  is  de¬ 
signed  for  both  the  microwave  feed  line  region  and  the  optical 
waveguide  region  [7]-[8].  Fig.  1  shows  the  SEM  picture  of  a 
300-/xm-long  device.  The  feed  line  is  500  /;m  long  on  each  side 
of  the  optical  waveguide.  The  ridge  is  formed  by  CH4/H2/At 
reactive  ion  etch  (RIE)  followed  by  a  HC1:H3P04  wet  etch  to 
reduce  RIE  damage.  PMGI  is  used  to  passivate  the  sidewalls 
and  to  reduce  the  capacitance  due  to  the  p-electrode  on  the 
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Fig.  1.  SEM  picture  of  the  top  view  of  a  300-;/ m  traveling-wave  EA 
modulator. 


Fig.  2.  SEM  facet  view  of  a  cleaved  modulator  with  a  ridge  width  of  2  //m. 

optical  waveguide.  As  shown  in  the  figure,  PMGI  bridges 
are  formed  on  both  ends  of  the  optical  waveguide  to  connect 
grounds  from  different  sides  of  the  optical  waveguide. 

The  materia]  structure  consists  of  0.5-^tm  n+-InP  bot¬ 
tom  contact  layer,  0.3-/iin  n-InP  cladding  layer,  ten  strain- 
compensated  InGaAsP  quantum  well  (10.4  nm,  0.37%  tensile 
strain)  and  InGaAsP  bamer  (7.6  nm,  0.5%  compressive  strain), 
1.5-^m  p-InP  cladding  layer  and  0.1-/im  p+-InGaAs  top 
contact  layer  on  the  semi-insulating  InP  substrate.  The  material 
has  a  room  temperature  photoluminescence  (PL)  peak  at  1495 
nm.  Fig.  2  shows  the  SEM  facet  view  of  a  cleaved  device 
with  a  width  of  2  /xm. 

III.  Experimental  Results 

A  lens  pair  is  used  to  couple  light  from  fiber  to  the 
waveguide  and  vice  versa.  High-frequency  measurements  are 
performed  with  an  HP  Lightwave  Component  Network  An¬ 
alyzer.  Two  Cascade  probes  are  used  for  the  microwave 
connection  and  one  of  the  probes  is  terminated  with  a  50-fl 
load. 
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Reverse  bias  voltage  (V) 


Fig.  3.  Fiber-to-fiber  transmission  versus  reverse  bias  voltage  for  several 
wavelengths  and  for  TE  and  TM  polarization  states.  Solid:  TE  polarization, 
Dashed:  TM  polarization. 


Fig.  4.  Measured  and  simulated  EO  responses  (in  dBe)  for  a  2-//m  wide 
300-// m-long  traveling-wave  modulator.  Curves:  Theory.  Dots:  Measurement. 

Fig.  3  shows  the  fiber-to-fiber  transmission,  versus  reverse 
bias  voltage  for  several  wavelengths  and  for  TE  and  TM 
polarization  states.  The  device  shows  little  dependence  on  the 
polarization,  which  is  a  result  of  proper  bandgap  engineering. 
The  2-^m-wide  300-^m-long  device  has  driving  voltages  of 
1.20  and  1.28  V  for  20  dB  extinction  ratio  for  TE  and  TM 
modes  respectively  at  wavelength  of  1.55  /urn.  A  fiber-to- 
fiber  insertion  loss  of  13.6  dB  is  achieved  at  1.57  /Ltm  for 
TM  polarization  without  antireflection  coating. 

An  equivalent  circuit  model  is  used  to  simulate  the  electrical 
to  optical  response  perfoimance  [8].  Three  different  cases  are 
compared:  1)  no  termination,  2)  with  50-ft  termination,  and 
3)  with  32-ft  termination.  The  calculated  EO  responses  are 
shown  as  lines  in  Fig.  4. 

The  frequency  responses  at  three  different  terminations  are 
measured  and  shown  as  dots  in  Fig.  4.  The  32-ft  termination 
is  achieved  by  wire  bonding  a  thin-film  resistor  to  the  device. 
With  a  50-ft  termination,  a  bandwidth  of  18  GHz  is  obtained, 
yielding  a  bandwidth  to  driving  voltage  (0.8  V  for  10-dB 
extinction)  ratio  of  22.5  GHz/V.  The  bandwidth  reduces  to 
10.7  GHz  when  there  is  no  termination.  The  reason  for  this 
large  bandwidth  reduction  is  the  large  reflection  from  the  un¬ 
terminated  port.  The  capacitance  and  the  series  resistance  of 
the  device  deduced  from  the  electrical  Sn  measurement  are 
0.40  pF  and  4.6  ft,  which  infers  RC  limited  bandwidths  of  7.3 


and  13.4  GHz  for  cases  with  and  without  a  50-ft  termination. 
The  larger  bandwidth  for  the  TEAM  than  that  for  the  lumped 
device  confirms  that  traveling-wave  electrode  structure  can 
overcome  the  RC  limitation. 

The  bandwidth  for  the  case  with  a  32-0  termination  in¬ 
creases  to  about  24.5  GHz,  yielding  a  bandwidth  to  driving 
voltage  (0.8  V  for  10-dB  extinction)  ratio  of  30.6  GHz/V. 

IV.  Discussion 

The  key  feature  on  the  response  curve  with  32-0  termi¬ 
nation  is  the  resonance  peak,  which  is  mainly  caused  by 
the  microwave  reflections  at  the  load  and  the  source  ends 
and  at  the  interfaces  between  the  optical  waveguide  and  the 
feed  lines.  Because  of  the  negative  reflection  at  the  load,  the 
response  at  some  frequencies  is  enhanced.  This  can  be  seen  in 
Fig.  5,  which  shows  the  simulated  optical  signal  output  under 
small  signal  modulation.  Here,  a  Gaussian-shaped  pulse  with 
T fwhm  =  10  %/IrT  2  ps  and  an  amplitude  of  0.07  V  is  used 
as  the  modulation  signal  that  superimposes  on  a  dc  bias  level 
of  1.2  V.  Without  any  load  termination,  the  output  optical 
pulse  profile  shows  multiple  reflection  structures  that  come 
from  the  interfaces  between  the  optical  waveguide  and  the 
feed  line,  and  between  the  feed  lines  and  the  source/load. 
These  reflections  have  the  same  sign  as  the  output  signal 
because  both  the  source  and  the  load  have  higher  impedance 
than  the  waveguide  (characteristic  impedance  ~25  ft)  and  the 
feed  line  (characteristic  impedance  ~42  ft).  The  reflections 
become  much  weaker  when  a  50-ft  termination  is  used.  The 
reflection  changes  the  sign  when  the  load  impedance  is  reduced 
to  32  ft  and  cancels  the  reflection  at  the  connections  between 
the  optical  waveguide  and  the  feed  lines.  This  cancellation  in 
reflection  causes  a  frequency  response  enhancement  at  high 
frequencies.  This  is  another  proof  that  the  device  is  operating 
in  a  traveling- wave  mode.  The  inductance  due  to  the  ribbon  is 
about  0.07  nH  and  is  not  the  main  reason  for  this  resonance  as 
verified  by  simulation.  The  use  of  a  32-ft  load  will  increase  the 
required  drive  voltage,  which  is  about  1.2  times  that  required 
for  a  load  of  50  ft.  This  could  be  overcome  by  the  longer 
device  length  than  that  of  conventional  EA  modulators. 

To  separate  the  microwave  propagation  loss  from  the 
impedance-mismatch  induced  reflection  loss,  the  microwave 
transmission  (S21)  for  devices  with  lengths  of  300,  400, 
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500,  and  600  ptm  was  measured.  It  is  found  that  at  20  GHz, 
the  600-fj.m  devices  have  about  2.0-dB  extra  loss  than  300- 
/.im  devices.  This  suggests  that  for  the  300-jitm  device,  the 
propagation  loss  at  20  GHz  is  about  2.0  dB,  while  the  sum 
of  the  reflection  loss  and  the  other  excess  loss  is  about  1.0 
dB  since  the  total  microwave  loss  is  3.0  dB.  Consequently, 
to  improve  the  device  bandwidth,  we  need  to  reduce  the 
propagation  loss  and  increase  the  waveguide  characteristic 
impedance  to  reduce  the  reflection  loss. 

The  improvement  in  bandwidth  with  terminating  the  trans¬ 
mission  line  is  larger  for  longer  devices,  as  expected  for 
a  traveling-wave  modulator.  However,  the  propagation  loss 
(about  0.67  dB/100  //m  at  20  GHz)  is  the  dominant  factor  that 
limits  the  bandwidth  in  longer  devices. 

Long  distance  digital  transmission  has  been  demonstrated 
at  10  and  30  Gb/s  with  the  300-/zm  traveling- wave  modulator 
and  will  be  presented  in  the  future. 


V.  Conclusion 

We  have  demonstrated  traveling-wave  electroabsorption 
modulators  using  the  InGaAsP-InGaAsP-InP  material  system. 
For  a  300-^m-long  device,  a  modulation  bandwidth  of  25 
GHz  is  achieved.  The  devices  show  little  dependence  on 
the  polarization  state.  Driving  voltages  of  1.20  and  1.28 
V  are  achieved  for  an  extinction  ratio  of  20  dB  for  TE 
and  TM  modes.  A  resonance  effect  is  observed  for  devices 
terminated  with  32-tt  loads.  This  resonance  effect  is  the  result 
of  microwave  reflections  at  the  source  and  the  load  ends. 
These  results  show  that  with  good  design,  traveling-wave  EA 
modulators  can  overcome  the  RC  limitation  and  obtain  higher 


speed,  lower  driving  voltage  and  larger  extinction  ratio  for 
optical  fiber  communication  applications. 
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Summary:  We  have  proposed  a  novel  scheme  for  in-line  optical 
regeneration  in  RZ  transmission  systems.  The  scheme  is  based  on 
the  principle  of  black  box  RZ/soliton  conversion,  synchronous 
modulation  and  filtering,  followed  by  reverse  soliton/RZ  conver¬ 
sion.  Potential  applications  range  from  SMF-based  terrestrial  sys¬ 
tems  to  dispersion-managed  soliton  systems  for  global-distance 
transmission. 
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Fused  lll-V  vertical  coupler  filter  with 
reduced  polarisation  sensitivity 

B.  Liu,  A.  Shakouri,  P.  Abraham,  Y.J.  Chiu  and 

J.E.  Bowers 

A  novel  fused  InGaAsP/InP-AlGaAs'GaAs  vertical  coupler  filter 
is  demonstrated.  By  using  the  large  material  dispersion  between 
InGaAsP  and  AJGaAs  and  similar  waveguide  geometries,  a  3.6  nm 
bandwidth  and  5nm  polarisation  dependent  wavelength  shift 
using  a  1.1mm  long  fused  vertical  coupler  has  been  achieved. 

Introduction:  Dense  wavelength  division  multiplexing  (DWDM) 
requires  narrowband  optical  filters  and  add/drop  channel  multi¬ 
plexer/demultiplexers.  Asymmetrical  vertical  directional  coupler 
filters  [1-4]  using  two  dissimilar  waveguides  on  III-V  semicon¬ 
ductors  are  promising  because  of  their  inherent  on-  chip  integra¬ 
tion  with  other  optoelectronic  devices  and  ease  of  fabrication.  In 
conventional  vertical  coupler  filters,  both  waveguides  are  fabri¬ 
cated  using  the  same  material  system,  InGaAsP/InP  [2  -  4]  or 
AlGaAs/GaAs.  A  narrow  bandwidth  requires  a  strong  waveguide 
dispersion  difference  between  two  waveguides.  This  is  usually 
achieved  using  dissimilar  waveguide  geometries.  One  of  the 
waveguides  is  narrow  with  high  index  contrast.  And  the  other 
wide  with  low  index  contrast.  There  are  several  obstacles  to  the 
use  those  vertical  coupler  filters,  including  a  strong  polarisation 
dependence  (the  centre  wavelength  shift  for  TE  and  TM  modes  is 
>  60 nm),  a  large  difference  in  coupling  efficiency  between  the  two 
dissimilar  waveguides,  and  the  difficulty  of  launching  the  light  into 
and  coupling  light  out  of  two  very  close  waveguides.  Since  the 
polarisation  dependence  and  different  coupling  efficiency  come 
from  the  strong  asymmetry  of  two  waveguide  geometries,  these 
problems  can  be  solved  if  the  two  waveguides  have  similar  struc¬ 
tures  and  indices,  but  they  have  a  large  material  dispersion  differ¬ 
ence. 

In  this  Letter,  we  report  a  novel  vertical  coupler  filter  (VCF) 
based  on  wafer  fusion  technology  [5,  6],  which  combines  two  dif¬ 
ferent  material  systems:  InGaAsP/InP  and  AIGaAs/GaAs.  By  uti¬ 
lising  the  large  material  dispersion  difference  between  InGaAsP 


and  AlGaAs  around  1.55  pm  wavelength  to  realise  narrowband  fil¬ 
ters  using  similar  waveguide  geometry  which  ensures  that  both 
waveguides  have  almost  identical  waveguide  dispersions  (same 
birefringence),  it  is  possible  to  considerably  reduce  the  polarisation 
sensitivity  of  the  vertical  coupler  filter.  Furthermore,  wafer  fusion 
can  facilitate  the  separation  of  the  waveguides  at  the  input  and  at 
the  output  [7,  8]. 


fusion 

interface 


2|un  inP 
0.4um  InGaAsP 
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Fig.  1  SEM  photograph  of  fused  In  Ga  As  P/In  P-A  IGaA  s/GaAs  vertical 
coupler  filter  and  near  field  of  fused  coupler  filter 


a  SEM  photograph 
b  Near  field 


Fabrication:  The  structure  of  the  fused  vertical  coupler  filter  is 
shown  in  Fig.  1.  The  upper  InGaAsP/InP  waveguide  consists  of  a 
0.4pm  InGaAsP  (Kap  =  1.45 pm)  guiding  layer  and  a  1.2pm  InP 
cladding  layer,  which  was  grown  on  an  InP  substrate  by  metal 
organic  chemical  vapour  deposition  (MOCVD).  The  lower 
AIGaAs/GaAs  waveguide,  grown  on  a  GaAs  substrate  by  MBE, 
includes  a  3pm  Al^Ga^As  isolation  layer,  0.53pm  Al^Ga^yA; 
core  layer,  0.2pm  Alo.5Gao.5As  cladding  layer  and  200 nm  GaAs 
cap  layer  that  serves  to  protect  the  AlGaAs  layer  from  oxidation. 
The  two  waveguides  are  phase  matched  around  1.55  pm.  3D  BPM 
calculations  predict  a  1  mm  coupling  length.  To  fabricate  the  fused 
vertical  coupler  filter,  rib  waveguides  with  a  ridge  height  of  1.2pm 
and  3-6 pm  widths  on  an  InP  sample  are  formed  using  selective 
wet  etching.  After  the  standard  cleaning  procedure,  the  InP  sam¬ 
ple  with  rib  waveguides  and  the  GaAs  sample  are  fused  together 
at  a  temperature  of  630°C  in  a  hydrogen  atmosphere  for  50min. 
Subsequently  the  InP  substrate  is  removed  by  HC1  etching.  Fig.  1 
shows  the  SEM  photograph  of  the  finished  device. 


Fig.  2  Measured  response  of  3pm  wide  fused  vertical  coupler  filter  for 
TE  and  TM  modes 
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Results  and  discussions :  The  optical  transmission  of  fused  vertical 
coupler  filters  is  measured  with  polarised  light  from  a  tunable 
semiconductor  laser  that  is  coupled  to  the  filters  via  a  singlemode 
lensed  fibre.  The  light  at  the  output  of  the  filters  is  coupled  to 
another  singlemode  lensed  fibre  connected  to  a  detector.  Fig.  1  h 
shows  the  near  field  image  with  a  x80  objective  at  the  output  of 
the  fused  vertical  coupler  filter.  The  mode  profiles  of  the  two 
waveguides  are  very  similar,  which  is  important  for  realising 
polarisation  independence  and  similar  coupling  efficiency.  Fig.  2 
shows  the  transmission  data  for  TE  and  TM  polarised  light  for  a 
3pm  wide  1.1mm  long  fused  filter.  The  3dB  bandwidths  of  the  TE 
and  TM  modes  are  3.6  and  3.3 nm,  which  closely  agree  with  the 
theoretical  values  of  3.8  and  3.5nm.  The  polarisation  dependent 
wavelength  shift  is  only  5nm.  Theoretical  calculations  predict  a 
7nm  shift  and  this  polarisation  dependence  can  be  eliminated  by  a 
slight  modification  to  the  design  by  replacing  the  1.45  pm 
InGaAsP  with  1. 37pm  InGaAsP.  Since  the  material  dispersion  is 
increased  when  the  wavelength  is  near  the  bandgap,  the  3dB  band¬ 
width  is  narrower  for  shorter  wavelength.  Fig.  3  shows  the 
dependence  of  bandwidth  on  wavelength,  which  is  agreement  with 
the  theoretical  calculations. 


Fig.  3  Wavelength  dependence  of  filter  bandwidth 


Summary:  A  novel  fused  InGaAsP/lnP-AJGaAs/GaAs  vertical 
coupler  filter  with  reduced  polarisation  dependence  has  been  fabri¬ 
cated  and  characterised.  The  large  material  dispersion  difference 
between  InGaAsP  and  AJGaAs  was  used  to  achieve  narrowband 
filters.  A  polarisation  dependent  wavelength  shift  of  5nm  has  been 
achieved.  The  corresponding  wavelength  shift  in  conventional  ver¬ 
tical  coupler  filters  is  >  60  nm.  With  an  improved  design,  polarisa¬ 
tion-independent  and  narrowband  fused  vertical  coupler  filters  can 
be  realised.  These  filters  are  promising  components  for  future 
WDM  systems. 
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Increased  speed  of  semiconductor  optical 
amplifier  wavelength  converter  at  multiple 
channels  using  Fabry-Perot  and 
microelectromechanical  filters 

D.  Mahgerefteh,  P.S.  Cho  and  P.  Tayebati 

A  Fabry-Perot  optical  discriminator  is  used  to  reduce  the  pattern 
dependence  of  a  semiconductor  optical  amplifier  wavelength 
converter  at  10 Gbit/s.  In  conjunction  with  a  voltage  tunable 
microelectromechanical  bandpass  filter,  this  allows  a  reduction  of 
conversion  penalty  for  multiple  wavelength  channels. 

Semiconductor  optical  amplifier  based  all-optical  wavelength  con¬ 
veners  are  being  considered  for  used  in  reconfigurable  nodes  of 
future  WDM  systems  because  of  their  compactness,  high-speed 
operation  and  low  power  requirements  [1],  To  be  cascadable,  sem¬ 
iconductor  optical  amplifiers  (SOAs)  have  to  have  a  modulation 
bandwidth  that  exceeds  the  bit  rate  of  operation.  In  addition  the 
wavelength  converter  should  be  able  to  convert  an  incoming  data 
lo  one  of  many  wavelength  channels  as  specified  by  a  circuit  con¬ 
trol  signal. 

High-speed  operation  can  be  obtained  by  using  long  semicon¬ 
ductor  optical  amplifiers,  high  currents,  and  high  optical  powers. 
Bit  rates  as  high  as  40  Gbit/s  have  been  demonstrated  in  SOAs  [2]. 
It  was  recently  demonstrated  that  a  fibre  Bragg  grating  optical  dis¬ 
criminator  placed  after  the  converter  can  also  increase  the 
dynamic  bandwidth  and  therefore  the  bit  rate  of  operation  of  an 
SOA  wavelength  converter  [3].  As  described  in  [3]  the  discrimina¬ 
tor  converts  the  phase  modulation  that  accompanies  the  converted 
signal  to  amplitude  modulation  to  sharpen  the  transition  edges 
between  the  1  and  Obits.  This  requires  that  the  wavelength  of  the 
converted  data  match  the  transmission  edge  of  the  fibre  grating 
such  that  the  converted  spectrum  lies  in  the  linear  transmission 
region  of  the  grating,  making  it  difficult  to  tune  to  different  chan¬ 
nels. 

Here  we  use  a  Fabiy-Perot  filter  with  a  free-spectral  range  of 
100GHz  (ITU  standard)  as  the  optical  discriminator.  Hence  the 
relative  spectral  position  of  the  signal  and  Fabiy-Perot  transmis¬ 
sion  spectrum  remains  fixed  as  the  wavelength  of  the  converted 
data  is  changed  from  one  channel  on  the  ITU  grid  to  the  other.  In 
addition  we  use  a  voltage  tunable  microelectromechanical  filter  for 
channel  selection  [4]. 

Fig.  1  shows  a  schematic  diagram  of  the  conversion  experiment. 
IOGbit/s  NRZ  data  and  a  CW  signal  are  launched  into  an  SOA  in 
a  co-propagating  configuration.  The  SOA  is  a  fibre-pigtailed 
amplifier  with  an  unsaturated  gain  of  26dB  at  200mA.  The  bias  of 
the  SOA  was  145mA.  The  optical  power  of  the  data  and  CW 
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optical  amplifier  (SOA).  The  phase  conjugated  signal  is  produced 
by  four-wave  mixing.  The  wavelength  shift  in  the  SOA-OPC  is 
-lOnm  The  converted  signal  is  filtered  out  in  two  stages  of  3nrn 
filter  and  amplified  by  an  EDFA.  The  measured  conversion  effi¬ 
ciency  is  -20.5dB  [4].  In  this  system,  it  is  assumed  that  the  condi¬ 
tion  (3 20  Z\  =  P? >  ^2  ^  satisfied,  where  the  amounts  of  total 
dispersion  in  both  sections  are  the  same.  The  lengths  of  the  trans¬ 
mission  fibres  in  each  section  are  zx  =  51.0km  and  z2  =  53.5  km. 
The  dispersion  values  are  =  -21.5ps2/km  (at  1550. 5  nm)  and 
P(22>  =  -20.5ps2/km  (at  1540.5nm).  The  input  powers  P*  at  the 
position  indicated  in  Fig.  li  are  set  to  be  equal.  The  signal  is 
detected  with  a  50GHz  pin  photodetector.  The  electrical  signal  is 
mixed  down  with  the  LO,  demodulated,  and  directed  to  the  BER 
tester. 
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Fig.  2  Measured  optical  spectra  at  midway  OPC  and  at  output ,  when 
input  power  Pm  is  5  dBm,  and  measured  optical  spectra  at  midway  OPC 
and  at  output,  when  input  power  P ^  is  15dBm 


a  Midway  OPC,  P =  5dBm 
b  Output,  Pm  ~  5 dBm 
c  Midway  OPC,  P„  -  15dBM 
d  Output,  P*  =  15  dBm 


however,  the  power  change  is  not  symmetric  with  respect  to  the 
OPC  position,  which  results  in  distorted  optical  spectra.  The 
degree  of  imperfect  compensation  becomes  larger  as  the  input 
power  increases.  Fig.  2d  shows  the  output  spectra  when  P„  is 
15  dBm.  The  distorted  spectrum  caused  by  the  modulational  insta¬ 
bility  is  also  observed  despite  the  use  of  the  midway  OPC.  Com¬ 
pared  with  the  case  in  which  the  midway  OPC  is  not  inserted, 
where  the  power  of  the  sideband  is  only  -5.65 dB  below  the  car¬ 
rier,  the  distortion  caused  by  GVD  and  SPM  is  partially  compen¬ 
sated  for  by  the  midway  OPC.  The  measured  BERs  are  plotted  in 
Fig.  3.  No  error  floor  is  observed  when  the  input  power  is 
<  5.0dBm.  With  increasing  input  power,  the  error  floor  is  clearly 
observed.  The  error  floor  is  produced  by  the  asymmetric  power 
change.  A  comparison  of  Lnl  with  the  transmission  lengths  z{  and 
z2  shows  that  the  OPC  must  be  inserted  at  the  position  where  the 
transmission  length  is  much  less  than  the  nonlinear  length. 

Conclusion:  We  have  observed  the  limitation  of  the  midway  OPC 
system  applied  for  60GHz  SCM  optical  DSB  signal  transmission. 
The  BER  degradation  was  caused  by  the  asymmetry  of  the  power 
change  in  the  sections  of  fibre  on  either  side  of  the  OPC.  These 
results  indicate  that  the  transmission  length  before  and  after  the 
OPC  must  be  much  less  than  the  nonlinear  length. 
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Fig.  3  Measured  BER  against  received  optical  power 

□  Pin  =  5  dBm 
A  Pin  =  lOdBm 
OPin  =  15  dBm 
#  back-to-back 

z,  =  51km  (at  1550.5nm),  z2  -  53.5km  (at  1540.5nm) 
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Fig.  2 a  and  b,  respectively,  show  the  spectra  at  the  midway 
OPC  and  at  the  output  of  the  system  when  the  input  power  Pm  = 
5 dBm.  The  nonlinear  length  Lnl,  as  defined  by  \tyPar>  is  calculated 
to  be  306km.  In  this  case,  the  transmission  lengths  2,  and  z2  are 
much  shorter  than  Lnl.  No  change  in  the  optical  spectrum  is 
observed  at  both  positions.  Fig.  2c  shows  the  optical  spectrum  at 
the  midway  OPC  when  P *  =  15dBm.  In  this  case,  Lsl  is  30.6km 
and  the  transmission  lengths  zx  and  z2  are  longer  than  Lsl.  The 
modulation  depth,  which  can  be  observed  by  the  power  ratio  of 
the  sideband  to  the  carrier  light,  becomes  higher  after  the  propa¬ 
gation  compared  with  the  case  of  P„  —  5  dBm.  This  is  because  of 
the  modulational  instability  [5,  6],  which  is  induced  by  the  small 
modulation  of  the  60GHz  millimetre-wave  data  signal.  Four-wave 
mixing  in  the  anomalous  dispersion  regime  along  with  SPM  causes 
modulational  instability.  In  an  ideal  OPC  system,  the  modula¬ 
tional  instability  that  occurs  in  the  first  section  can  be  compen¬ 
sated  for  in  the  second  section.  In  a  practical  OPC  system, 


High-speed  operation  of  travelling-wave 
electroabsorption  modulator 

V.  Kaman,  S.Z.  Zhang,  A.J.  Keating  and  J.E.  Bowers 

The  authors  describe  the  high-speed  operation  of  an 
electroabsorption  modulator  with  travelling-wave  electrodes.  The 
modulator  exhibited  good  performance  over  a  broad  bandwidth 
in  10  and  30Gbit/s  transmission  experiments.  The  polarisation- 
insensitive  electroabsorption  modulator  required  a  low  switching 
voltage  of  ~  1.6Vp.p. 

Introduction:  The  speed  of  optical  fibre  telecommunication  systems 
based  on  electrical  time  division  multiplexing  has  increased  rapidly 
over  the  last  decade  enabling  40Gbit/s  throughput  for  a  single  car¬ 
rier  [i,  2].  Electroabsorption  (EA)  modulators  with  very  low  driv¬ 
ing  voltage,  high  modulation  efficiency,  polarisation  insensitivity 
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and  integrability  with  lasers  [3  -  6],  make  them  very  attractive 
devices  for  optical  communications  systems.  The  bandwidth  of  EA 
modulators  based  on  a  lumped-electrode  structure  is  limited  by  the 
total  capacitance;  short  lengths  result  in  high  bandwidths  while 
exhibit  better  extinction  ratios,  necessitating  a 
tradeoff  between  bandwidth  and  extinction  ratio.  Recently,  it  has 
been  shown  that  longer  devices  with  potentially  lower  drive  volt¬ 
ages  and  increased  extinction  ratio  can  be  achieved  using  a  travel- 
hng-wave  electrode  structure  that  overcomes  the  RC  limitation 
[7,  8J. 

We  have  recently  demonstrated  a  polarisation-insensitive  travel- 
hng-wave  EA  modulator  operating  at  1.55  pm,  with  a  bandwidth 
^  a  20dB  extinction  ratio  (DC)  voltage  of  1.20  and 
1. 28V  for  the  TE  and  TM  modes,  respectively  [9].  In  this  Letter 
we  report  on  the  10  and  30Gbit/s  operation  of  this  modulator. 


Modulator  characteristics:  The  travelling-wave  EA  modulator  used 
m  this  experiment  is  similar  to  the  device  described  in  [91.  The 
device  was  designed  to  be  polarisation-insensitive  over  a  range  of 
wavelengths.  The  fibre-to-fibre  insertion  loss  was  16dB  at 
1542nm.  A  drive  voltage  of  2V  was  required  for  a  maximum 
extinction  ratio  (DC)  of  30dB.  The  25GH*  bandwidth  was 
achieved  by  terminating  the  travelling-wave  electrode  in  a  low 
resistance  using  a  thin-film  resistor. 


Fig.  1  Bit  error  rate  curves  for  JO  Gbit/s  transmission 


ess 


vnS8oJ!v^v)ed  eye  diagram  af*er  I50km  transmission  (H;  50ps/div, 
A  back-to-back,  TM  mode 
O  back-to-back,  TE  mode 
A  150km  DSF  transmission 


■JOGtaA  transmission:  To  explore  the  higher-speed  characteristics 
or  the  EA  modulator,  transmission  experiments  at  30  Gbit/s  were 
earned  out  using  the  setup  shown  in  Fig.  2.  The  30Gbit/s  NR 2 
electncal  signal  was  generated  by  multiplexing  four  7.5Gbit/s  (27-l 
pattern  length)  data  streams  delayed  by  10  bits  from  each  other 
using  a  4:1  multiplexer  (MUX)  [10].  The  pattern  length  was  lim¬ 
ited  by  the  receiver  electronics;  however,  excellent  low  frequency 
^rformancc  of  the  EA  modulator  was  confirmed  with  the 
10 Gbit/s  transmission  experiment  described  earlier.  At  the 
receiver,  an  EDFA  was  used  as  an  optical  preamplifier  into  the 
packaged  receiver  with  a  30GHz  bandwidth  [1 1].  Broadband  elec¬ 
trical  amplifiers  followed  the  optical  receiver  and  the  signal  was 
fed  into  a  30Gbit/s  1:4  demultiplexer  (DMUX)  and  a  dock  recov¬ 
ery  circuit  (CRC)  [10]. 


1542nm 


optical 

preamplifier 


Has) 

Fig.  3  Bit  error  rate  for  back-to-back  30  Gbit/s  transmission 


JO  Gbit/s  transmission:  Transmission  experiments  using  the  travel¬ 
ling-wave  EA  modulator  at  lOGbit/s  were  carried  out  (23l-l  pat¬ 
tern  length)  with  commercially  available  components  A  drive 
voltage  of  -  L6V„  was  applied  to  the  EA  modulator  by  high¬ 
speed  probes.  A  DFB  laser  operating  at  1542nm  followed  by  a 
polarisation  controller  was  used  as  the  light  source  into  the  EA 
modulator.  Two  lens  pairs  were  used  to  couple  the  light  into  and 
out  of  the  modulator.  The  output  signal  from  the  EA  modulator 
was  amplified  by  an  erbium-doped  fibre  amplifier  (EDFA)  before 
it  was  launched  into  150km  of  dispersion-shifted  fibre  (DSF)  with 
a  mean  dispersion  parameter  of  -0.93ps/(nm-km).  The  a  parame¬ 
ter  of  the  modulator  was  +1.2  at  the  operating  bias  of  ~1.1V. 
Flgj  \  s^°^s  l*ie  bit  error  rate  (BER)  performance  of  the  EA 
modulator  for  back-to-back  and  >  1 50km  of  DSF.  In  the  back- 
measurcment5>  err°r-free  operation  and  a  sensitivity  of 
-3 1.2 dBm  were  achieved  for  both  the  TM  and  the  TE  polansa- 
tions  without  changing  the  operating  conditions  of  the  modulator. 
This  indicates  that  the  polarisation  dependence  of  the  EA  modula- 
l<f B  low- An  improved  sensitivity  of -32. 8 dBm  was  achieved 
alter  150km  DSF  transmission  due  to  the  interplay  between  the 
positive  chirp  of  the  modulator  and  the  slightly  negative  disper¬ 
sion  of  the  fibre.  The  inset  in  Fig.  1  shows  the  received  eye  dia¬ 
gram  after  transmission  over  150km  of  DSF  fibre.  The  long 
pattern  length  used  in  this  transmission  indicates  the  good  low-fre¬ 
quency  characteristics  for  this  modulator. 
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Inset:  (H:  13 ps/di v,  V:  lOOmV/div) 

(i)  30Gbit/s  output  signal  from  4:1  MUX 
(n)  received  back-to-back  30Gbit/s  signal 

Fig.  3  shows  the  BER  performance  of  the  travelling-wave  EA 
modulator  for  back-to-back  transmission  at  30 Gbit/s.  Error-free 
operation  and  an  optical  sensitivity  of  -20.2dBm  were  achieved 
for  the  best  demultiplexed  channel.  The  inset  in  Fig.  3  shows  (a) 
the  30 Gbit/s  output  of  the  4:1  MUX  and  (b)  the  received  30Gbit/s 
signal  after  back-to-back  transmission.  The  slight  eye  closure  on 
every  other  channel,  we  believe,  is  due  to  an  on-chip  clock-phase 
ah^unent  problem  in  the  electrical  MUX.  Improvement  in  the 
30Gbit/s  optical  sensitivity  should  be  possible  by  improving  the 
receiver  and  the  fibre  coupling  into  and  out  of  the  EA  modulator. 

Summary:  We  have  successfully  demonstrated  10  and  30Gbit/s 
operation  of  a  travelling-wave  EA  modulator.  These  experiments 
confirmed  a  broad  bandwidth  and  a  low  switching  voltage  for 
these  modulators.  Our  results  indicate  that  travelling-wave  EA 
modulators,  with  their  low  driving  voltage  and  polarisation-insen- 
siUvity ,  are  strong  candidates  for  future  high-speed  optical  com¬ 
munication  systems. 
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Increased  input  power  dynamic  range  of 
Mach-Zehnaer  wavelength  converter  using  a 
semiconductor  optical  amplifier  power 
equaliser  with  8dBm  output  saturation 
power 

J.-Y.  Emery,  B.  Lavigne,  C.  Porcheron,  C.  Janz, 

F.  Dorgeuille,  F.  Pommereau,  F.  Gaborit,  I.  Guillemot- 
Neubauer  and  M.  Renaud 

A  two-section  semiconductor  optical  amplifier  is  proposed  and 
investigated  for  use  as  a  power  equaliser,  providing  a  3dB  output 
saturation  power  of  8  ±  ldBm.  Using  the  device  to  control  the 
input  power  of  a  Mach-Zchnder  wavelength  converter,  an 
increase  in  the  input  power  dynamic  range  of  the  converter  from 
2.5  to  >  20dB  is  demonstrated  at  lOGbit/s. 

Introduction :  Semiconductor  optical  amplifiers  (SOAs)  are  promis¬ 
ing  devices  for  optical  system  applications  such  as  amplification, 
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switching  and  wavelength  conversion.  Recently,  a  power  equalisa¬ 
tion  function  has  been  identified  as  important  in  order  to  increase 
the  low  input  power  dynamic  range  of  optical  wavelength  convert¬ 
ers  based  on  interferometric  structures  [1  -  3],  Single  SOA  devices 
can  provide  power  equalisation,  but  the  usable  output  power  is 
limited  because  of  the  dependence  of  the  output  saturation  power 
on  the  driving  current.  As  a  consequence,  the  input  power 
dynamic  range  of  the  equaliser  is  also  limited.  By  cascading  two 
optical  amplifiers  the  capability  of  increasing  the  output  power 
saturation  of  the  power  equaliser  has  been  demonstrated  [2,  4].  In 
such  a  device,  the  input  signal  power  is  controlled  by  the  first 
amplifier  while  the  second  amplifier  maintains  a  sufficient  gain 
and  a  high  output  saturation  power.  By  cascading  an  SOA  and  an 
Er-doped  fibre  amplifier,  the  input  power  dynamic  range  of  an 
interferometric  wavelength  converter  has  previously  been  increased 
from  3  to  28  dB  [1].  Recently,  5  dBm  output  saturation  power  has 
been  demonstrated  in  a  two-section  SOA  based  on  a  multiquan¬ 
tum  well  structure  [4], 

In  this  Letter,  we  report  an  optical  power  equaliser  based  on  a 
two-section  SOA  which  provides  8  dBm  output  saturation  power. 
lOGbit/s  experiments  with  NRZ  signal  have  been  carried  out 
using  the  two-section  SOA  to  control  the  input  power  of  a  Mach- 
Zehnder  interferometer  (MZI)  wavelength  converter.  Control  of 
the  bias  current  in  the  first  section  enables  us  to  increase  the  input 
power  dynamic  range  of  the  MZI  from  2dB  to  >  20 dB  with  a  sen¬ 
sitivity  penalty  of  <  1  dB. 


Fig.  1  Fibre- to- fibre  gain  against  output  power  measured  at  1.55pm  for 
different  bias  currents  in  first  section  of  SOA  power  equaliser 

Inset:  schematic  diagram  of  two  section  SOA  with  passive  waveguide 
in  between 

From  top  to  bottom:  Bias  current  =  200,  100,  80,  60,  50,  40,  30mA 


Device  principle  and  design:  The  power  equaliser  is  based  on  a 
two-section  SOA  (inset  Fig.  1).  In  such  a  device,  the  bias  current 
of  the  first  section  allows  us  to  control  the  optical  power  entering 
into  the  second  section.  In  the  second  section,  a  high  current  is 
maintained  in  order  to  provide  a  sufficient  gain  and  a  high  output 
power.  To  accept  a  high  input  signal  power,  the  input  SOA  has  to 
work  in  a  near-transparent  regime.  In  this  regime  the  noise  figure 
increases  rapidly,  leading  to  a  reduction  in  the  output  optical  sig- 
nal-to-noise  ratio.  The  noise  figure  can  be  limited  by  either 
decreasing  the  cavity  length  of  the  SOA  or  by  decreasing  the  opti¬ 
cal  confinement  factor  of  the  active  layer.  The  device  shown  in 
Fig.  1  is  realised  by  a  combination  of  these  two  approaches.  The 
polarisation-insensitive  SOA  structure  is  based  on  a  low-tensile 
strained  bulk  separate  confinement  heterostructure  [5]  with  an 
active  layer  thickness  of  0.12pm,  giving  an  optical  confinement 
factor  of  -0.25.  The  active  buried  ridge  stripe  is  800pm  in  total 
length  including  150pm  long  tapered  sections  at  both  ends.  A  dou¬ 
ble  core  taper  consisting  of  the  tapered  active  region  and  an 
underlying  passive  waveguide  enables  us  to  couple  the  two  SOAs. 
The  passive  section,  in  between  the  two  SOAs,  provides  electrical 
insulation  of  the  electrodes  and  an  attenuation  function.  This 
attenuation  due  to  propagation  losses  (5-7 dB)  limits  the  gain  sat¬ 
uration  of  the  second  amplifier  by  the  amplified  spontaneous 
emission  emitted  from  the  first  SOA  (and  reciprocally).  The  over¬ 
all  device  length  is  2.4mm  including  passive  waveguides  at  each 
end  which  provide  typical  full  beam  divergences  of  13°  x  15°  [6]. 
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Push-Pull  Fused  Vertical  Coupler  Switch 

Bin  Liu,  Ali  Shakouri,  Patrick  Abraham,  and  John  E.  Bowers,  Fellow ,  IEEE 


Abstract — A  single-electrode  push-pull  fused  vertical  coupler 
(FVC)  switch  using  macroscopic  crystal  inversion  symmetry  is 
demonstrated.  The  anisotropic  linear  electrooptic  effect  in  a  zinc- 
blende  crystal  is  used  to  achieve  optical  switching  under  12-V 
reverse  bias  for  a  6.9-mm-long  FVC  whose  two  guiding  layers 
have  different  crystal  symmetries.  No  switching  is  observed  for 
the  identical  structure  in  which  the  two  guiding  layers  have  the 
same  crystal  symmetry. 

Index  Terms — Electrooptic  effects,  optical  directional  couplers, 
optical  switches,  water  bonding. 

VERTICAL  directional  couplers  [1]— [3]  are  very  attractive 
candidates  to  make  optical  waveguide  switches  and  nar¬ 
rowband  filters  because  of  their  very  short  coupling  length 
and  the  feasibility  of  integration  with  other  optoelectronic 
devices.  Generally,  switching  is  achieved  by  introducing  a 
phase  mismatch  (A (3)  between  two  waveguides  through  the 
electrooptic  effect.  In  a  push-pull  operation,  the  introduction 
of  positive  and  negative  phase  shifts  in  the  two  waveguides 
of  the  coupler  reduces  the  driving  voltage  and  chirping  for 
switching.  It  is  well  known  that  the  linear  electrooptic  effect 
is  anisotropic  [4],  [5]  in  zinc-blende  ciystal  structures.  When 
the  applied  electric  field  is  perpendicular  to  the  (001)  surface,  it 
gives  a  positive  index  change  +A n  for  the  TE-polarized  light 
propagating  along  [110]  direction  and  a  negative  index  change 
-An  for  the  light  propagating  along  the  [110]  direction.  In 
conventional  epitaxial  vertical  couplers,  the  upper  and  lower 
waveguides  have  the  same  crystal  orientation  [see  Fig.  1(b)]. 
Consequently  push-pull  operation  requires  the  signs  of  the 
electric  fields  in  the  upper  and  lower  waveguides  to  be 
opposite.  This  requires  presence  of  a  third  electrode  between 
waveguides  and  application  of  positive  and  negative  biases  to 
the  upper  and  lower  waveguides.  The  fabrication  is  difficult  in 
conventional  vertical  couplers.  Since  the  electrooptic  effect  is 
anisotropic,  if  one  of  the  waveguides  in  the  vertical  couplers 
is  along  the  [110]  orientation  and  the  other  one  along  the 
[110]  orientation  [see  Fig.  1(a)],  then  under  an  applied  bias, 
the  index  changes  in  two  waveguides  have  different  signs.  This 
simplifies  the  electrode  fabrication  and  only  requires  one  elec¬ 
trode  for  push-pull  operation.  In  this  letter,  a  vertical  coupler 
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Fig.  1.  SEM  pictures  and  the  crystal  orientations  of  (a)  antiphase  FVC  and 
(b)  in-phase  FVC. 

switch  with  macroscopic  inversion  symmetry  is  demonstrated 
that  uses  the  wafer  fusion  technique. 

Wafer  fusion  [6],  [7]  is  a  powerful  technique  to  fab¬ 
ricate  optoelectronic  devices  that  cannot  be  realized  using 
conventional  epitaxial  growth  and  processing.  In  addition 
to  the  inherent  advantage  of  combining  two  materials  with 
a  large  lattice  mismatch,  wafer  fusion  has  another  unique 
feature  to  integrate  two  wafers  with  various  crystallographic 
orientations.  For  (001)  InP  and  GaAs  wafers,  there  are  two 
ways  to  orient  the  samples  before  fusion.  One  is  in-phase 
fusion  [8],  which  is  shown  in  Fig.  1(b),  where  the  [110] 
axis  of  the  top  wafer  is  peipendicular  to  the  [110]  axis  of 
the  bottom  wafer.  This  structure  is  equivalent  to  that  grown 
by  heteroepitaxy.  The  other  one  is  antiphase  fusion  which 
is  shown  in  Fig.  1(a),  where  the  [110]  axes  of  two  wafers 
are  parallel.  This  structure  cannot  be  realized  using  epitaxial 
growth  techniques.  Macroscopically,  the  lattice  structure  of 
antiphase  fused  material  has  inversion  symmetry  at  the  fusion 
interface.  It  is  obvious  that  the  push-pull  vertical  coupler 
switch  requires  antiphase  fusion  so  that  the  applied  vertical 
electrical  field  induces  opposite  index  change  in  the  two 
waveguides.  Since  some  of  the  electrical  and  optical  properties 
of  crystals,  as  well  as  their  processing  characteristics  depend 
on  the  crystal  orientation,  wafer  fusion  gives  an  extra  degree  of 
freedom  to  fabricate  new  types  of  devices.  For  example,  Yoo 
[9]  has  used  wafer  fusion  to  change  the  ciystal  orientation 
periodically  to  realize  quasi-phase-matched  second-harmonic 
generation  (SHG). 
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In-phase  F VC  Anti-phase 


Fig.  2.  I-V  curves  of  in-phase  and  anti-phase  FVC’s  and  the  electrolumi¬ 
nescence  images  under  80-mA  forward  current. 


In  this  letter,  we  study  two  kinds  of  FVC’s  fabricated  using 
in-phase  and  antiphase  fusion.  The  schematic  structure  of  the 
devices  is  shown  in  Fig.  1,  which  uses  the  technology  reported 
in  [10]  and  [11].  Two  wafers  were  grown  using  metal-organic 
chemical  vapor  deposition  (MOCVD).  For  the  first  wafer,  on 
n+  (001)  InP  substrate,  a  0.5-^m  InGaAsP  (A^  =  1.3  (J.m) 
guiding  layer,  followed  by  0.1  -^m  InP  cladding  layer,  20- 
nm  InGaAsP  (A^  =  1.15  n m)  etch  stop  layer  and  0.4-/xm 
InP  coupling  layer  were  grown.  All  layers  were  undoped. 
The  second  wafer  was  grown  on  p+  (001)  InP  substrate. 
It  consists  of  0.2-/zm  p+  InGaAs  layer,  followed  by  2  fi m 
p  (5  x  1017/cm3)  InP  layer  and  the  same  intrinsic  InGaAsP 
and  InP  layers  as  the  first  wafer.  The  last  190  nm  of  the 
bottom  2-fxm  InP  layer  was  undoped  to  avoid  Zn  diffusion  to 
the  quaternary  layer  during  the  end  of  the  growth  and  wafer 
fusion.  The  0.2-ixm  p+  InGaAs  layer  was  used  as  an  etch 
stop  layer  to  remove  the  substrate.  The  device  fabrication 
starts  by  cleaving  two  approximately  8  x  12  mm2  samples 
from  each  of  the  grown  wafers.  The  top  0.4-/zm  InP  layer  of 
p+  samples  is  removed.  On  n-b  samples,  a  ridge  waveguide 
structure  with  2-3-^m  width  along  the  [110]  direction  is 
formed  using  standard  photolithography  and  selective  wet 
etching  techniques.  The  [110]  direction  was  chosen  since  HC1 
etchant  produces  straight  side  walls  in  this  direction.  The  n+ 
and  p-j-  samples  are  then  fused  together  at  a  temperature  of 
630  °C  in  a  hydrogen  atmosphere  for  50  min.  In  one  case 
(fused  sample  A),  the  p+  sample  was  oriented  so  that  its  [1 10] 
direction  was  parallel  to  the  waveguides  on  n-b  substrate  (i.e., 
antiphase  fusion).  For  fused  sample  B,  the  orientation  of  the 
p-b  sample  was  chosen  to  get  the  in-phase  fusion.  After  fusion, 
p+  InP  substrates  for  both  samples  are  removed  using  HC1 
etching.  Standard  ohmic  contacts  were  formed  on  both  sides 
of  the  wafers  to  be  able  to  apply  a  bias.  Fig.  1  shows  the  stain 
etched  scanning  electron  microscope  (SEM)  pictures  for  both 
antiphase  (A)  and  in-phase  (B)  FVC’s. 


Wavelength  (nm) 
(a) 


Wavelength  (nm) 

(b) 

Fig.  3.  The  light  intensity  at -the  output  of  upper  (solid  line)  and  lower 
(dash  line)  waveguides  of  (a)  antiphase,  (b)  in-phase  FVC’s  as  a  function 
of  wavelength. 


Fig.  2  shows  the  current  versus  voltage  (I-V)  curves  of  the 
samples  A  and  B.  The  device  size  is  about  7  mm  x  3  ^m. 
There  is  a  small  forward  voltage  drop  of  I  V  at  the  fusion 
interface.  The  leakage  current  of  sample  A  at  reverse  biases  is 
a  little  higher  than  that  of  sample  B.  Under  forward  bias,  the 
luminescence  images  of  anti-  and  in-phase  fused  devices  are 
found  to  be  similar  as  it  can  be  seen  in  Fig.  2. 

To  characterize  FVC’s,  a  tunable  laser  source  is  used  to 
input  TE-polarized  light  through  a  lensed  single-mode  fiber 
(SMF).  The  image  at  the  output  of  coupler  facet  is  recorded 
with  an  IR  camera  with  an  80  x  objective.  The  samples  are 
mounted  to  a  temperature  stabilized  stage.  First,  the  passive 
switching  of  the  two  FVC  structures  was  characterized  by 
changing  the  input  wavelength.  In  response,  the  effective 
coupling  length  changes  and  the  output  light  switches  between 
upper  and  lower  waveguides  (Fig.  3).  The  oscillation  period  is 
a  function  of  the  coupling  strength  between  two  waveguides 
and  the  total  length  of  the  couplers.  The  difference  in  the 
measured  oscillation  period  of  sample  A  (11  nm)  and  sample 
B  (13  nm)  matches  very  well  with  the  different  length  of 
the  couplers  (sample  A:  6.9  mm  long,  sample  B:  5.9  mm 
long).  This  shows  that  samples  A  and  B  have  the  same 
coupling  length.  Because  of  the  variation  in  amplitude,  the 
half  period  looks  different  at  different  wavelength  (the  “eye” 
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Fig.  4.  The  light  intensity  at  the  output  of  upper  (solid  line)  and  lower 
(dashed  line)  waveguides  of  (a)  antiphase  and  (b)  in-phase  FVC’s  as  a  function 
of  reverse  bias  voltage. 

is  big  or  small),  but  the  actual  oscillation  period  measured 
from  a  distance  between  consecutive  minimums  is  within 
our  experimental  resolution  (zbl  nm).  Then  a  reverse  bias 
is  applied  to  both  samples.  The  normalized  intensities  at  the 
output  of  upper  and  lower  waveguides  as  a  function  of  bias 
voltage  are  shown  in  Fig.  4(a)  (sample  A)  and  (b)  (sample 
B).  The  antiphase  FVC  switches  at  a  bias  of  12  V  while  no 
switching  is  observed  for  in-phase  fused  sample  B. 

It  is  known  that  the  mechanisms  of  index  change  in  p-i-n 
structures  include  the  linear  electrooptic  (LEO)  effect,  the 
quadratic  electrooptic  (QEO)  effect  and  the  free-carrier  ef¬ 
fect  due  to  the  modulation  of  the  depletion  layer.  In  the 
current  structure,  because  of  a  thick  1.6-yum  intrinsic  layer 
and  because  the  operation  wavelength  is  far  away  from  the 


bandgap,  the  QEO  and  free-carrier  effects  are  very  weak. 
Furthermore,  QEO  and  free-carrier  effects  should  be  the  same 
for  both  samples,  because  they  are  independent  of  the  ciystal 
orientation.  Therefore,  the  LEO  effect  dominates  in  current 
FVC  structures.  The  switching  in  sample  A  is  because  of  the 
push-pull  configuration  which  comes  from  the  inverted  crystal 
orientation.  In  sample  B,  the  index  change  in  top  and  bottom 
waveguides  is  the  same,  so  the  switching  requires  a  much 
higher  voltage. 

In  conclusion,  a  push-pull  fused  vertical  coupler  switch  with 
crystal  inversion  symmetry  has  been  demonstrated.  Switching 
under  12-V  reverse  bias  is  observed  for  antiphase  FVC’s  while 
there  is  no  switching  for  in-phase  FVC’s.  The  wafer  fusion 
technique  simplifies  the  complicate  electrode  configuration 
in  conventional  push-pull  type  couplers.  Additionally,  other 
novel  devices  can  be  realized  using  different  crystal  structures 
integrated  together. 
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As  with  our  earlier  devices,  the  structures  are  ridge-guided  opti¬ 
cal  waveguide  rings  formed  in  GaAIAs/GaAs  double  quantum 
well  material.  The  ridge  width  is  2jim  and  the  cavity  lengths  arc 
7.35  and  7mm.  When  the  laser  is  at  threshold,  the  cavity  losses  arc 
compensated  for  by  the  gain  and  an  optimum  value  of  k2  -  0.5% 
is  found.  The  rings  are  placed  one  within  another,  lightly  coupled 
by  a  directional  coupler  section  (Fig.  2).  The  outer  one  is  coupled 
via  a  multimode  interference  (MM1)  coupler  to  a  straight  output 
waveguide.  The  MMI  coupler  is  6pm  wide  and  370pm  long,  and 
has  been  designed  for  a  -3dB  split  (kx  =  ^0.5).  The  directional 
coupler  between  the  two  resonant  rings  has  a  minimum  waveguide 
separation  of  4pm  to  give  a  much  smaller  coupling  ratio;  the 
design  has  been  simulated  by  BPM,  which  gives  k2  =  0.03.  The 
output  waveguide  is  tilted  at  5°  to  the  cleaved  sides  of  the  sub¬ 
strate  to  reduce  reflections  back  into  the  resonator.  Separate  con¬ 
tacts  are  made  to  the  inner  ring  and  to  the  outer  ring. 


Fig.  2  Layout  of  double-ring  laser ;  inner  and  outer  ring  directionally 
coupled  (weakly);  MM1  coupling  to  output  waveguide 


to  the  inner  ring  and  increasing  the  drive  current  by  degrees,  the 
multiple  lasing  resonances  of  the  outer  ring  are  gradually  sup¬ 
presses  and  are  replaced  with  wider-spaced  resonances,  the  fre¬ 
quency  spacing  of  which  corresponds  to  the  free  spectral  range 
(FSR)  of  the  two-ring  resonant  structure,  as  shown  in  Fig.  3  for 
an  inner-ring  drive  current  of  900mA.  Two  lasing  modes  are 
apparent  here;  the  measured  frequency  separation  of  ~0.7  nm  is 
near  to  the  calculated  FSR  of  the  double  ring  resonator  of 
0.64nm.  The  centre  frequency  of  the  spectrum  has  moved  to  a 
longer  wavelength.  We  think  this  is  because  the  inner,  lower  loss, 
ring  cavity  has  a  gain  peak  at  a  longer  wavelength,  due  to  a  lower 
carrier  density  at  threshold  altering  the  refractive  index. 

The  filter  theory  used  to  describe  the  coupled  ring  resonant  cir¬ 
cuit  is  linear,  excluding  saturation,  and  is  of  limited  value  when 
applied  to  the  lasing  structure,  but  confirms  the  experimental 
behaviour.  The  need  for  a  small  coupling  ratio  has  been  empha¬ 
sised  by  numerical  simulation  based  on  the  model  of  Lang  and 
Kobayashi  [4].  Dynamic  behaviour  ranging  from  self-pulsation  to 
coherence  collapse  was  found  for  a  master-slave  in-line  semicon¬ 
ductor  laser  system  [5],  for  which  dynamically  stable  operation 
was  only  obtained  with  a  coupling  ratio  <  1%.  In  that  analysis  the 
lasers  were  coupled  through  an  isolator,  although  subsequent  cal¬ 
culations  on  a  similar  system  but  omitting  the  isolator  generated 
similar  results.  We  plan  to  extend  that  dynamic  analysis  to  the 
present  case,  to  obtain  a  realistic  model  of  the  double-ring  laser. 
Further  designs  of  double-ring  resonator  are  being  fabricated,  to 
be  used  either  as  passive  transmission  filters  or  as  lasers.  Opera¬ 
tion  outside  the  regimes  of  stability  should  be  possible,  where  we 
hope  to  observe  self-pulsation  and  coherence  collapse. 


Fig.  3  Double  ring  laser  -  output  spectrum  with  Iexl  =  900mA  and  Iml  =  0 
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Vertical  coupler  with  separated  inputs  and 
outputs  fabricated  using  double-sided 
process 

B.  Liu,  A.  Shakouri,  P.  Abraham  and  J.E.  Bowers 

A  novel  vertical  coupler  used  as  a  wavelength  multiplexer  with 
separated  input  and  output  waveguides  is  demonstrated.  The 
wafer  fusion  technique  enables  both  frontside  and  backside 
processing  of  the  thin  epitaxial  films.  A  vertical  coupler  with  a 
4mm  interaction  region  demonstrates  20 rum  wavelength  spacing. 


The  devices  have  been  tested  by  biasing  the  outer  ring  above 
threshold  and  observing  how  the  biasing  of  the  inner  ring  affects 
the  output  of  the  device.  With  a  forward  current  of  900mA 
applied  to  the  outer  ring  (1.5  times  threshold),  a  multimode  spec¬ 
tral  output  was  observed,  as  in  Fig.  3.  By  applying  a  forward  bias 


Introduction:  Directional  waveguide  couplers  are  fundamental 
components  in  photonic  integrated  circuits  used  in  optical  commu¬ 
nications  systems.  They  can  be  used  as  optical  switches,  splitters, 
modulators  and  narrowband  filters  [1  -  4].  Conventional  direc¬ 
tional  couplers  are  planar  where  the  two  waveguides  arc  horizon¬ 
tally  (laterally)  arranged.  One  disadvantage  of  the  planar  design  is 
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thai  the  spacing  between  two  coupled  waveguides  is  limited  by  the 
abneahon  techniques.  This  results  in  long  coupling  lengths 
(around  several  millimetres)  [1  -3].  Vertical  directional  couplers, 
on  the  other  hand,  have  a  very  short  coupling  length  that  is  even 

lhan  ^  spacing  between  two  vertically 

stacked  waveguides  can  be  reduced  to  <  0.5pm,  and  this  can  be 
precisely  controlled  by  the  material  growth.  However  the 
two  input  or  output  waveguides  in  vertical  couplers  are  so 
close  (<  1pm)  together  that  the  direct  coupling  of  individual 
waveguides  to  fibres  is  very  difficult.  This  has  limited  the  practical 
applications  of  vertical  couplers  in  fibre-optic  systems.  To  solve 
this  problem,  recently  a  novel  fused  vertical  coupler  [6,  7]  based  on 
the  wafer  fusion  technique  [8]  has  been  proposed  where  the  two 
strongly  coupled  waveguides  arc  fabricated  on  two  different  sub¬ 
strates  and  separated  laterally. 


Fig.  1  Coupler  fabrication  steps 


a  Conventional  processed  epitaxial  layer  structure 
c  After*’ 'fll2UrC  iS  ^edarid  bonded  to  another  host  substrate 
structure™ pr^d'®1  substrate.  “r>oscd  backside  of  epitaxial 


SiN 

O-S/iim  InP 
0.2umInP 
0.5um  InGaAsP 
0.6um  InP 
0.5  pm  InGaAsP 
O^pmlnP 
0.8mn  InP 
fusion  interface 


Fig.  3  SEM  picture  of  vertical  coupler  fused  to  InP  substrate 


In  this  Letter,  we  demonstrate  a  vertical  coupler  with  separated 
mput  and  output  waveguides  fabricated  using  both  frontside  and 
backside  processing  of  the  same  substrate.  Using  wafer  fusion  a 
conventionally  processed  epitaxial  layer  structure  (Fig.  \a)  ’ is 
inverted  and  bonded  to  a  new  host  substrate  (Fig.  lb).  After 
removing  the  top  substrate  and  leaving  only  the  thin  epitaxial  film 
bonded  to  the  host  material,  the  exposed  backside  of  the  epitaxial 
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}hQn  **  Passed  as  well  (Fig.  Ic).  This  technique 
enables  the  fabrication  of  3D  photonic  integrated  circuits. 

Experiment:  The  schematic  drawing  in  Fig.  2  shows  a  vertical 
coupler  with  separated  inputs  and  outputs,  where  the  two 
wavegutdes  are  coupled  vertically  and  separated  horizontally  in 
different  planes.  The  device  fabrication  begins  with  a  metal- 
organic  chemical  vapour  deposition  (MOCVD)-grown  structure 
5Flf-  A\  1 **ieh  includes  a  0.8pm  InP  frontside  ridge  layer,  a  15nm 
InGaAsP  (bandgap  1.1pm)  etching  stop  layer,  a  0.2pm  InP  cap 
layer,  a  0  5pm  InGaAsP  (bandgap  1.3pm)  frontside  guiding  layer, 
a  0.6pm  InP  coupling  layer,  the  same  backside  guiding,  cap,  etch 
stop  and  ndge  layers,  and  finally  a  0.2pm  InGaAs  for  substrate 
removal  on  InP  substrate.  First,  the  3pm  width  frontside  ridge 
waveguides  are  formed  by  reactive  ion  etching  (RIE)  and  chemical 
wet  etching.  The  frontside  guiding  layer  above  the  backside 
waveguide  structure  in  non-coupling  areas  is  removed  by  further 
wet  etching.  The  waveguide  sample  is  then  inverted  and  fused  to  a 
bare  InP  substrate  under  pressure  for  40min  at  630°C.  After 
removing  the  top  substrate  and  InGaAs  etch  stop  layer,  the  back¬ 
side  waveguides  are  fabricated  and  the  unnecessary  guiding  layer 
above  the  frontside  waveguide  structure  is  removed  as  before  The 
waveguide  alignment  is  facilitated  using  infrared  photolithogra- 


Fig.  4  Light  output  intensity  against  wavelength 

— O —  right  waveguide 
^ - left  waveguide 

Near  field  images  at  outputs  are  also  shown 


Fig.  3  shows  the  SEM  picture  of  the  coupling  region.  In  our 
former  fused  vertical  couplers  [6],  the  fused  interface  was  between 
the  two  guiding  layers.  This  introduced  an  excess  optical  loss  due 
to  imperfections  at  the  non-epitaxial  interface.  To  separate  the 
input  and  output  waveguides,  it  was  necessaiy  to  align  the  two 
wafers  with  submicrometre  resolution  during  the  wafer  fusion 
process.  In  our  current  work,  the  fused  interface  is  not  important 
smoe  it  is  far  away  from  the  guiding  layer.  Also,  no  alignment  is 
necessary  during  the  fusion  process. 


vucS  and  dLscusslons:  To  Characterise  the  device  performance 
light  from  a  tunable  laser  was  coupled  to  an  input  waveguide  by  a 
singlemode  fibre.  The  light  at  the  output  was  collected  by  another 
smglemode  fibre  which  was  connected  to  a  detector.  Fig  4  shows 
toe  measured  output  light  intensity  as  a  function  of  wavelength. 
The  near  field  images  at  the  output  were  recorded  by  an  IR  cam- 
era  with  a  Jens  and  they  are  shown  in  Fig.  4.  The  total  device 
length  is  8  mm,  toe  coupler  length  is  4mm  and  the  separation 
between  two  waveguides  is  20pm.  As  we  expected,  the  output 
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intensity  is  a  periodic  function  of  wavelength  and  the  oscillation 
period  is  ~40nm.  This  means  that  the  device  can  be  used  as  a 
wavelength  multiplexer/demultiplexer  for  20nm  wavelength  spac¬ 
ing.  This  small  wavelength  spacing  is  very  difficult  to  achieve  in  a 
conventional  horizontally  arranged  coupler  because  of  the  weak 
wavelength  dependence  of  the  coupling  coefficient  [9].  By  cascad¬ 
ing  several  strongly  coupled  vertical  couplers  together,  multi-chan¬ 
nel  multiplexers  with  several-nanometre  sparing  can  be  realised. 

It  is  worth  noting  that  in  the  processing  of  both  sides  by  the 
wafer  fusion  technique  only  one  epi-wafer  is  needed  and  another 
host  wafer  can  be  of  any  material,  such  as  Si,  glass,  etc.  This  will 
reduce  the  cost  and  give  more  flexibility  for  the  fabrication  of  3D 
photonic  integrated  circuits. 

Conclusion:  We  have  successfully  realised  a  vertical  coupler  with 
separated  input  and  output  waveguides  by  using  wafer  fusion 
technology.  This  device  can  be  used  as  a  wavelength  multiplexer  in 
fibrc-optic  systems.  20mn  channel  sparing  has  been  achieved  for  a 
vertical  coupler  with  4mm  coupling  region.  Wafer  fusion  technol¬ 
ogy  enables  both  frontside  and  backside  processing  of  thin  epitax¬ 
ial  films  and  is  an  important  tool  for  realising  novel  devices  and 
3D  photonic  integrated  circuits. 
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Compact  optoelectronic  oscillator  with  ultra- 
low  phase  noise  performance 

Yu  Ji,  X.S.  Yao  and  L.  Maleki 

■  A  compact  optoelectronic  oscillator  (OEO)  is  presented  which  is 
using  constructed  using  a  DFB  laser,  a  semiconductor  Mach- 
Zehnder  modulator,  and  a  dielectric  resonator  based  RF  filter. 
The  achieved  phase  noise  is  -50dBc/Hz  at  1 0Hz  and  -130dBc/Hz 
at  10  kHz  from  a  10GHz  oscillation  frequency,  a  performance 
comparable  to  that  of  an  OEO  constructed  with  a  diode-pumped 
YAG  laser  and  an  LiNbCb  modulator. 

Introduction:  Optoelectronic  oscillators  (OEOs)  have  the  capability 
of  achieving  ultra-low  phase  noise  for  both  microwave  and  optical 
communications  (1).  The  oscillation  in  an  OEO  is  produced  via  a 
feedback  loop  which  includes  a  modulator,  an  optical  fibre  delay 
element,  and  a  photodetector.  However,  the  previously  demon¬ 
strated  OEOs  were  constructed  with  expensive  and  bulky  diode- 
pumped  YAG  lasers,  LiNb03  modulators,  and  cavity  RF  filters. 
For  communications,  radar,  and  space  applications,  compact  and 
low  cost  OEOs  are  preferred.  We  report  in  this  Letter  a  compact 
and  high  performance  OEOs  constructed  with  an  integrated  mod¬ 
ule  consisting  of  a  DFB  laser  and  a  semiconductor  modulator,  and 
a  dielectric  resonator  based  RF  filter.  The  experimental  results 
demonstrate  that  low  cost  semiconductor  lasers  and  modulators, 
together  with  low  cost  dielectric  resonators,  can  be  used  to  con¬ 
struct  optoelectronic  oscillators. 

2cmx4cm 


fibre 

delay 


Ireni 

Fig.  1  Configuration  of  compact  OEO 

Configuration  of  compact  OEO:  The  configuration  of  the  compact 
10GHz  OEO  is  shown  in  Fig.  1.  We  used  a  DFB  laser  integrated 
with  a  semiconductor  Mach-Zehnder  modulator.  The  measured 
RIN  (relative  intensity  noise)  of  the  laser  output  is  -1  lOdBc/Hz  at 
10Hz  and  -13SdBc/Hz  at  10 kHz,  which  sets  a  limit  on  the  phase 
noise  of  the  OEO. 

The  key  to  the  low  phase  noise  performance  of  an  OEO  is  the 
long  optical  fibre  loop  delay.  The  highest  spectral  purity  signals 
with  the  OEO  are  achieved  with  the  longest  fibre  length  [1]. 
Because  the  OEO  is  essentially  a  multimode  device,  with  its  mode 
sparing  inversely  proportional  to  the  length  of  the  fibre,  achieving 
ultra-low  phase  noise  requires  a  filter  with  narrow  enough  band¬ 
width  to  select  a  mode  for  operation  at  a  single  frequency. 
Although  a  multi-loop  scheme  can  be  used  for  singlcmodc  selec¬ 
tion  [2],  it  increases  the  complexity  and  size  of  the  OEO.  In  this 
Letter,  we  use  an  ultra-narrow-bandwidth  filter  constructed  with  a 
dielectric  resonator  (DR)  for  singlemode  selection. 

'  Dielectric-resonator-loaded  high-Q  narrowband  filters  can  be 
designed  to  occupy  a  total  volume  only  ~5%  of  that  of  a 
waveguide  filter  with  an  equivalent  performance  and  their  Q  at 
room  temperature  can  be  as  high  as  104.  In  addition,  the  tempera¬ 
ture  coefficient  of  the  filter  is  exceptionally  low,  down  to  ±1  ppm/ 
°C  at  room  temperature  [3].  We  designed  and  fabricated  such  a  fil¬ 
ter  by  placing  an  8.7mm  x  4mm  high  dielectric  constant  (t,  =  30) 
ceramic  cylindrical  disc  at  the  centre  of  an  aluminium  cavity  with 
a  size  three  times  that  of  the  DR  disc.  Using  a  design  tool  based 
on  the  finite  element  method,  we  found  that  the  TEs  ^  mode  has 
a  frequency  of  10  GHz,  and  more  than  90%  of  the  energy  can  be 
confined  in  the  disc.  To  obtain  optimum  mode  matching,  a  tiny 
wire  loop  probe  was  used  for  both  mode  excitation  and  coupling. 
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A  100-kHz  to  50-GHz  Traveling- Wave 
Amplifier  IC  Module 

Volkan  Kaman,  Student  Member,  IEEE,  Tom  Reynolds,  Anders  Petersen,  and  John  E.  Bowers,  Fellow,  IEEE 


Abstract — Broad-band  microwave  packaging  of  a  traveling- 
wave  amplifier  with  an  on-chip  bandwidth  of  2-50  GHz  is 
described.  Techniques  to  reduce  overall  insertion  losses  and  to 
extend  the  low-frequency  cutoff  of  the  amplifier  while  maintain¬ 
ing  gain-flatness  are  discussed.  The  packaged  amplifier  modules 
exhibit  excellent  performance  from  100  kHz  to  50  GHz.  They  are 
demonstrated  as  modulator  drivers  and  receiver  amplifiers  in  a 
30-Gbit/s  digital  optical  communication  system. 

Index  Terms — Coplanar  waveguides,  MMIC  amplifiers,  optical 
communication,  packaging,  traveling-wave  amplifiers. 


width  of  2-50  GHz  [1 1].  We  describe  packaging  techniques  on 
reducing  the  overall  insertion  losses  as  well  as  the  design  of 
the  drain  bias  circuitry  to  extend  the  low-frequency  cutoff  of 
the  amplifier  down  to  100  kHz  while  maintaining  gain-flatness 
over  a  broad  bandwidth.  The  packaged  modules  were  used  as 
modulator  drivers  and  receiver  amplifiers  in  a  30-Gbit/s  optical 
transmission  system  [12]. 

II.  Amplifier  Packaging 


I.  Introduction 

THE  SPEED  of  optical  fiber  telecommunication  systems 
based  on  electrical  time  division  multiplexing  has  in¬ 
creased  rapidly  over  the  last  decade.  Systems  operating  at 
10  Gbit/s  are  beginning  to  be  deployed  while  the  expected 
generation  of  40  Gbit/s-per-wavelength  optical  commu¬ 
nication  systems  has  already  been  demonstrated  in  various 
laboratory  experiments  [1],  [2].  Baseband  amplifiers  used  as 
modulator  drivers,  preamplifiers,  and  postamplifiers  are  key 
components  for  such  high-speed  systems.  A  linear  response 
with  little  magnitude  variation  and  a  constant  group  delay 
over  a  broad  bandwidth  is  essential  for  low  bit  error  rates 
with  long  pattern  lengths.  Amplifiers  with  these  characteristics 
have  been  demonstrated  in  various  integrated  circuit  (IC) 
technologies  (see  for  example  [3]— [5]).  The  packaging  of 
high-speed  ICs  is  a  real  challenge  since  the  overall  package 
parasitics  (the  transmission  line,  discontinuities,  wire  bonds 
and  lumped  elements  used  in  the  module)  become  significant 
at  high  frequencies  and  can  severely  limit  the  performance 
of  the  broadband  IC  being  packaged.  Some  very  impressive 
high-speed  modules  have  been  demonstrated  for  in-house  IC’s 
either  by  optimizing  the  module  for  the  given  IC  [6]-[8]  or  by 
designing  the  IC  according  to  the  known  package  parasitics 
[91  [10].  ^ 

In  this  letter,  we  present  the  packaging  of  a  commercially 
available  traveling-wave  amplifier  that  has  an  on-chip  band- 
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A.  IC  Characteristics 

The  frequency  response  for  on-wafer  measurements  and 
other  characteristics  for  the  commercial  amplifier  are  given 
in  [11].  Bonding  pads  are  provided  to  allow  the  amplifier  to 
operate  at  frequencies  lower  than  2  GHz  and  to  supply  the 
drain  and  gate  biases  by  means  of  external  circuit  components. 
However,  since  the  output  of  the  amplifier  is  directly  connected 
to  the  drain  bias  pad,  any  parasitic  coupling  in  the  drain 
bias  circuitry  reflects  as  resonances  and  frequency  response 
deterioration.  In  addition  to  these  constraints,  the  input  and 
the  output  of  the  amplifier  need  to  be  externally  AC-coupled 
without  introducing  excess  insertion  loss. 

B.  Package  Design 

A  coplanar  waveguide  (CPW)  transmission  line  was  chosen 
for  the  ease  of  providing  a  low-inductance  ground  between  the 
chip  and  the  surface  ground  of  the  substrate.  A  quartz  substrate 
was  used  to  minimize  CPW  radiation  and  skin  losses  since  it 
has  a  low  dielectric  constant  {er  =  3.92).  In  order  to  suppress 
cavity  resonances  and  microstrip  modes,  the  quartz  substrate 
was  suspended  over  an  air  gap  with  a  microwave  absorber. 
The  3350-/zm-long  identical  substrates  used  at  the  input  and 
the  output  of  the  amplifier  were  designed  to  have  minimum 
insertion  and  return  losses.  Fig.  1  shows  a  photograph  of  the 
packaged  IC. 

An  important  package  parasitic  comes  from  the  wire  bond 
connection  between  the  chip  and  the  CPW  transmission  line, 
which  introduces  inductance  for  both  the  signal  and  the 
ground  connections.  It  was  shown  in  [9]  that  this  parasitic 
could  be  reduced  by  designing  the  IC  according  to  the  outer 
bonding  wire  inductance.  However,  this  technique  is  not 
possible  for  the  packaging  of  a  commercial  IC;  therefore,  an 
optimization  of  this  interconnection  is  required.  We  observed 
experimentally  that  gold  ribbon  with  a  width  of  75  ^m  has 
less  inductance  than  the  conventionally  used  wire  bonds.  The 
bonding  pads  of  the  IC  were  aligned  with  the  top  surface  of 
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Fig.  1.  Photograph  of  the  packaged  amplifier  IC 


the  substrate  to  minimize  the  length  and  the  inductance  of  the 
ribbon.  The  length  of  the  ribbons  was  approximately  150  fxm 
long  with  an  estimated  inductance  of  less  than  1  nH/mm. 

The  AC-coupling  of  the  input  and  the  output  of  the  amplifier 
are  conventionally  achieved  by  mounting  a  surface  mount 
dielectric  (SMD)  capacitor  across  a  gap  on  the  signal  line. 
However,  since  broad-band  (100  kHz  to  50  GHz)  SMD’s 
are  commercially  unavailable,  a  parallel  combination  of  a 
180-pF  dielectric  capacitor  (di-cap)  and  a  33-nF  SMD  was 
chosen  [Fig.  2(a)].  In  order  to  simulate  the  input  and  output 
substrates  simultaneously,  a  pair  of  the  capacitor  combination 
was  epoxied  on  a  7-mm-long  CPW  line.  Frequency  response 
measurements  showed  that  there  was  about  2.4-dB  extra  inser¬ 
tion  loss  due  to  the  two  gaps  and  the  capacitor  combinations  at 
40  GHz.  In  order  to  eliminate  the  discontinuities  on  the  CPW, 
the  module  was  designed  such  that  the  capacitors  would  be 
placed  directly  under  the  K-connector  launch  [Fig.  2(b)].  This 
new  design  improved  the  insertion  loss  by  1  dB  at  40  GHz 
and  allowed  for  slightly  shorter  CPW  lines. 

A  similar  parallel  configuration  of  capacitors  was  connected 
to  the  low-frequency  extensions  of  the  amplifier.  The  parasitics 
of  the  capacitors  or  the  gold  ribbon  were  not  seen  to  be  sig¬ 
nificant  on  the  frequency  response  of  the  amplifier.  However, 
at  frequencies  below  2  GHz,  a  gain  rise  was  caused  due  to 
the  increase  in  the  drain  termination  resistor  value  from  50  to 
65  ([11,  Fig.  1]).  This  is  overcome  in  the  drain  bias  circuitry 

as  will  be  described  in  the  following  section. 

HI.  Drain  Bias  Circuit  Design 

The  challenge  of  the  drain  bias  circuit  design  was  to  use 
commercial  components  (with  limited  bandwidth  and  uncer¬ 
tain  parasitics)  while  also  incorporating  an  external  resistor 
to  reduce  the  low-frequency  gain;  it  was  essential  that  the 
whole  circuit  looked  like  an  open  from  the  output  pad  up  to 
50  GHz.  This  design  turned  out  to  be  the  most  crucial  aspect 
of  the  package  since  any  coupling  of  the  parasitics  resulted 


K-connector 


/ 


K-connector 

/ 


Fig.  2.  Illustrative  diagrams  of  different  AC-coupling  schemes:  (a)  capacitor 
combination  mounted  on  the  signal  line  next  to  a  gap.  Ribbon  bonds  are  used 
to  connect  the  capacitor  and  the  signal  line  over  die  gap  and  (b)  capacitor 
combination  mounted  under  the  K-connector  launch,  thereby  eliminating  the 
gap  (signal  line  discontinuity)  and  the  ribbon  bonds. 


in  huge  resonances  in  the  frequency  response  of  the  module. 
The  circuit  schematic  that  gave  a  consistent  fiat  response  for 
a  broad  bandwidth  is  shown  in  Fig.  3.  The  inductor/resistor 
combination  Ll/Rl  through  L4  ensures  gain-flatness  from 
100  kHz  to  2  GHz.  It  is  significant  to  mention  that  the 
components  were  chosen  by  simulation  to  minimize  any 
resonance  between  a  pure  inductor  and  the  preceding  parasitic 
capacitance.  In  order  to  minimize  the  coupling  between  the 
ribbon  inductance  (L6)  and  its  preceding  parasitic  capacitance 
(which  resonates  strongly  in  the  gigahertz  range),  an  inductor 
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Fig.  3.  The  drain  bias  circuitry  used  for  biasing  the  amplifier.  R5  is  the 
low-frequency  gain  reduction  thin-film  resistor  patterned  on  a  quartz  substrate. 


Fig.  4.  Measured  insertion  and  return  losses  of  the  amplifier  module. 


packaged  amplifier’s  response  is  in  good  agreement  with  the 
chip  performance  with  the  exception  of  a  resonance  at  45  GHz 
and  a  deterioration  of  the  return  losses.  The  resonance  around 
45  GHz  is  due  to  the  bandwidth  limit  of  the  K-connectors 
used,  which  will  be  improved  by  using  V-connectors.  We  also 
believe  that  the  return  loss  will  be  improved  once  broad-band 
capacitors  for  ac-coupling  are  available.  The  amplifier  modules 
were  used  as  modulator  drivers  and  receiver  postamplifiers 
in  a  30-Gbit/s  NRZ  transmission  experiment  [12].  Fig.  5(a) 
shows  the  30-Gbit/s  input  and  (b)  the  3  V,,.,,  output  of  three 
cascaded  amplifier  modules.  No  significant  degradation  of  the 
eye  is  observed.  Bit  error  rate  measurements  confirmed  this 
with  error  free  operation  for  input  signals  above  -22  dBm. 

V.  Conclusion 

In  summary,  we  presented  the  microwave  packaging  of 
a  commercially  available  traveling-wave  amplifier  with  an 
on-chip  bandwidth  of  2-50  GHz.  By  using  techniques  to 
reduce  excess  insertion  loss  and  a  careful  design  of  the  drain 
bias  circuitry,  a  bandwidth  of  100  kHz  to  50  GHz  was 
achieved.  These  amplifier  modules  are  suitable  for  high  bit-rate 
transmission  systems,  which  was  successfully  demonstrated  in 
a  30-Gbit/s  transmission  experiment. 


(b) 


Fig.  5.  The  30-Gbit/s  transmission  performance  of  three  cascaded  amplifier 
modules:  (a)  30-Gbit/s  NRZ  input  (H:  13  ps/div,  V:  100  mV/div)  and  (b) 
30-Gbit/s  output  from  the  amplifiers  (H:  20  ps/div,  V:  750  mV/div). 

with  a  high  self-resonating  frequency  and  a  thin-film  resistor 
with  minimal  parasitics  and  a  small  bonding  pad  were  used. 
The  purpose  of  the  R5  thin-film  resistor  was  to  reduce  the  low- 
frequency  gain  described  in  the  preceding  section.  The  ribbon 
length  of  2.5  mm  was  very  critical  for  the  high-frequency 
(>2  GHz)  response  of  the  module.  It  was  chosen  to  be  long 
enough  to  act  as  a  broadband  high-frequency  inductor,  but 
short  enough  to  shift  the  resonance  due  to  the  preceding 
parasitic  capacitance  out  of  the  operational  bandwidth, 

IV.  Performance  of  Packaged  Amplifier 

Fig.  4  shows  the  measured  gain  and  return  losses  for  the 
module.  The  gain  is  at  an  average  of  8.5  dB  with  a  gain- 
flatness  within  ±1.5  dB  from  100  kHz  to  50  GHz.  The 
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Abstract.  Based  on  radiation  modes  and  phase  relations  in  different  ARROW  structures,  the  characteristic 
equations  are  presented  that  can  avoid  root  searching  in  the  complex  plane  and  find  the  effective  index, 
loss  and  field  profile  easily.  This  simple  model  gives  an  accurate  intuitive  picture  for  low  loss  leaky  modes 
and  it  can  be  used  to  design  and  optimize  the  low  loss  ARROW  devices  of  practical  interest. 


Key  words:  ARROW,  mode  characteristics,  optical  waveguides,  transfer  matrix 


1.  Introduction 

Antiresonant  reflecting  optical  waveguides  (ARROW)  (Duguay  et  al.  1986) 
have  attracted  a  great  deal  of  interest  during  the  past  several  years.  Instead  of 
total  internal  reflection  in  conventional  waveguides,  ARROWs  use  antires¬ 
onant  reflection  as  the  guiding  mechanism.  This  gives  ARROW  some  re¬ 
markable  features  that  are  used  in  many  applications,  such  as  remote 
couplers  (Gehler  et  al.  1994),  filters  (Chu  et  al.  1996),  and  polarization 
splitters  (Trutschel  1995).  In  order  to  design  ARROW  devices,  the  knowledge 
of  the  characteristics  of  ARROW  modes  is  important.  An  approximate  ex¬ 
pression  for  the  propagation  constant  and  loss  have  been  given  by  Baba  et  al. 
(1998)  and  Baba  and  Kokubun  (1992).  The  equivalent  transmission  line 
(Jiang  et  al.  1989)  and  the  transverse  resonance  method  (Huang  et  al.  1992) 
have  also  been  used  to  investigate  the  dispersion  and  loss.  A  rigorous  nu¬ 
merical  method  is  based  on  a  well  known  transfer  matrix  method  (Chilwell 
and  Hodgkinson  1984;  Kubica  et  al.  1990).  Because  of  the  leaky  property  of 
ARROW  modes,  the  solutions  must  be  found  in  the  complex  plane.  The  root 
searching  in  the  complex  plane  could  be  time  consuming  and  tedious,  espe¬ 
cially  for  optimization  of  multiple  layer  structures.  In  this  paper,  we  will  give 
a  set  of  simple  characteristic  equations  for  ARROW  structures.  The  root 
searching  for  the  new  equations  needs  to  be  carried  only  on  the  real  axis  and 
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the  error  is  negligible  for  low  loss  ARROW  modes  of  practical  interest  (loss 
smaller  than  a  few  dB/cm).  In  section  2,  characteristic  equations  of  different 
ARROW  structures  are  presented,  which  are  based  on  radiation  modes  and 
the  phase  relation  in  ARROW  devices.  Section  3  examines  the  modal  char¬ 
acteristics  such  as  effective  index,  loss  and  mode  profiles  in  most  of  the 
ARROW  structures  proposed  up  to  now. 

The  results  from  our  equations  and  the  exact  model  are  then  compared. 
Furthermore,  a  new  ARROW  filter,  which  combines  a  conventional  InP/ 
InGaAsP  waveguide  and  an  AlGaAs/GaAs  ARROW,  is  proposed  and 
simulated  in  Section  4. 


2.  Characteristic  equations 

We  consider  a  planar  multilayer  waveguide  as  shown  in  Fig.  1.  The  general 
solution  of  the  wave  equation  in  each  layer  is  well  known: 

EyJ  =  Aj  exp (kj(x  -  Xj))  +  Bj  exp ■(-*,(*  -  Xj))  (1 ) 

where  kj  =  y  /?*  -  tfnj,  Aj  and  Bj  are  the  complex  field  coefficients,  k0  is  the 
free  space  wavenumber,  fS  =  k0nc{r  is  the  propagation  constant,  «cff  is  the 
effective  index  and  Xj  is  the  position  of  layer  j.  By  imposing  the  continuity  of 
the  field  and  its  derivative  for  each  interface,  it  is  easy  to  find: 


Fig.  1.  Schematic  diagrams  of  a  planar  ARROW  structure  and  the  field  profile  (dash  line). 
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So  we  can  relate  field  coefficients  in  the  cover  ( Ac  and  Bc)  with  the  coefficients 
in  the  substrate  ( As  and  Bs): 


(2) 


t=tn---txtc  = 


tn 

h\ 


t\2 

til 


For  the  guiding  modes,  the  fields  should  be  evanescent  in  the  cap  and  the 
substrate  layers,  so  Ac  =  0  and  Bs  =  0  that  results  in  the  characteristic 
equation: 


=  0  (3) 

For  ARROW  modes,  since  they  are  leaky,  a  characteristic  equation 
tw(P)  =  0  can  be  found  by  assuming  outgoing  waves  in  the  cover  and 
substrate  layers,  with  correct  sign  of  kc  and  ks  chosen  (Chilwell  and 
Hodgkinson  1984).  The  root  fi  resides  in  the  complex  plane,  so  the  root 
searching  for  structures  with  many  layers  and  for  optimization  purposes 
could  be  time  consuming  and  tedious.  In  the  following,  based  on  physical 
arguments,  we  will  introduce  a  different  characteristic  equation  for  the 
radiation  modes,  that  gives  the  ARROW  mode  effective  indices  on  the 
real  axis. 

Radiation  modes  require  both  incoming  and  outgoing  components  to 
form  standing  waves  in  the  substrate  layer  for  one  sided  radiation  modes, 
or  in  both  cover  and  substrate  layers  for  two  sided  radiation  modes. 
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Equation  (1)  holds  for  radiation  modes  as  well.  Since  the  number  of  un¬ 
known  variables  is  larger  than  the  number  of  boundary  conditions,  a 
characteristic  equation  cannot  be  established.  A  simple  relation  between  As 
(Ac)  and  Bs  (Bc)  will  allow  us  to  get  a  characteristic  equation  for  the  case  of 
low  loss  ARROW  modes.  When  the  interference  layers  in  ARROW 
waveguides  satisfy  the  antiresonant  condition,  the  reflectivity  is  very  close 
to  unity  and  the  phase  shift  in  each  layer  is  90°.  This  assures  that  the  field 
at  the  outermost  interface  is  a  node  or  an  anti-node,  i.e.,  As  =  ±BS  or 
Ac  =  ±BC  for  one  sided  radiation  modes  and  As  —  ±BS,  Ac  —  ±BC  for  two 
sided  radiation  modes  from  Equation  (1).  The  sign  depends  on  the  index  of 
outermost  two  layers. 

Now,  let  us  look  at  each  case  separately. 

(a)  One  sided  ARROW  modes:  In  one  sided  ARROW,  the  field  in  one  side 
(cover)  is  evanescent  Bc  —  0  and  in  another  side  (substrate)  is  standing  wave 
(actually  the  amplitude  of  the  ‘mode’  is  increased  with  the  distance  because  of 
its  lossy  nature  (Peierls  1979),  i.e.  ne ft  >  nc  and  «Cff  <  ns.  When  the  substrate 
index  is  larger  than  that  of  the  last  layer,  ns  >  «N  (the  right  side  of  Fig.  1),  the 
field  in  the  outmost  interface  is  a  node,  As  =  -Bs.  From  Equation  (1)  we  get 
the  characteristic  equation: 


*n  +  '21  =  0 

When  the  index  of  substrate  is  smaller  than  that  of  the  last  film  layer  ns  <  nN, 
the  field  in  the  outmost  interface  is  a  anti-node  (the  left  side  of  Fig  1) 
AS=BS.  So 


'ii  -  *2i  =  0  (5) 

(b)  Two  sided  ARROW  modes:  In  two  sided  ARROW,  the  field  in  both  sides 
are  standing  waves,  i.e.  «cff  <  nc  and  nc fr  <  n%.  There  are  three  cases: 

•  «s  >  «N  and  nc>  n\.  In  this  case,  As  =  -Bs  and  Ac  =  -Bc,  so 

t\ !  “  t\2  +  h\  —  t22  =  0  (g) 

•  ns  >  «n  and  nc  <  (Fig.  1)  or  ns  <  nN  and  nc  >  nu  As  =  -Bs  and 
Ac  =  Be  or  A s  =  B%  and  Ac  =  -Bc,  so 


(7) 


ti\  +  t\2  +  t2\  +  ti2  =  0 

•  ns  <  «N  and  nc  <  n\,  As  =  Bs  and  Ac  —  Bc,  so 
h  1  +  *12  ~  *21  +  *22  —  0 


(8) 
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Equations  (3)-(8)  are  the  characteristic  equations  for  all  kinds  of  ARROW 
structures.  For  all  of  these  the  effective  index  for  guided  (Equation  (3))  and 
leaky  modes  (Equations  (4)-(8))  for  lossless  materials  can  be  found  on  the 
real  axis.  After  finding  the  mode’s  effective  index,  it  is  simple  to  get  its 
profile  if  we  choose  the  correct  Ac  and  Bc.  Based  on  the  first  order  per¬ 
turbation  theory,  the  loss  of  leaky  modes  can  be  found:  a  =  4.34/co 
Im  [/]  i  («cff) / f7)  i  («cff)]  dB/pm,  where  1m  is  the  imaginary  part,  /n(ncff)  is  the 
differential  with  respect  to  ncff- 


3.  Examples 

In  order  to  check  the  accuracy  of  our  characteristic  equations  for  different 
ARROW  structures,  we  have  compared  the  calculations  with  the  exact  model 
(Chilwell  and  Hodgkinson  1984;  Kubica  et  al.  1990).  Figs.  2(a)-(d)  show  the 
calculated  effective  index,  loss  and  error  as  a  function  of  the  first  interference 
layer  thickness  d\  of  an  ARROW-A  structure  using  our  method  and  the  exact 
model  (Chilwell  and  Hodgkinson  1984;  Kubica  et  al.  1990).  Although  this 
model  is  based  on  antiresonant  condition,  our  computation  shows  that 
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Fig.  2.  The  calculated  effective  index  (a)  and  loss  (c)  of  TE0  mode  of  ARROW-A  structure  as  a  function 
of  the  thickness  of  the  first  antiresonant  layer,  (b)  and  (d)  are  the  errors  in  our  model  comparing  to  the 
exact  calculations. 


antiresonance  is  not  critical.  Even  when  the  thickness  d\  is  far  away  from 
antiresonant  condition,  the  current  model  matches  the  exact  calculations  very 
well  (see  Fig.  2).  For  example,  when  d}  is  around  the  resonant  point  where 
loss  is  the  highest,  the  error  between  our  method  and  the  exact  model  is  still 
smaller  than  0.2%  for  TE1  mode.  Table  1  displays  the  effective  index  and 
loss  for  several  ARROW  waveguides  including  ARROW-B  (Baba 
and  Kokubun  1992),  two  sided  ARROWs  and  ARROW  couplers  (Chen  and 
Huang  1996).  We  can  see  that  for  low  loss  ARROW  modes,  which  are  of 
main  interest  in  practical  applications,  the  approximate  model  with  real  roots 
in  identical  to  the  exact  complex  analysis. 


4.  InP/InGaAsP-AIGaAs/GaAs  ARROW  filter 

In  this  section,  we  propose  and  design  an  ARROW  filter  (Fig.  3)  which  is 
based  on  two  different  waveguides:  a  conventional  InGaAsP/InP  waveguide 
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Table  1.  Comparison  of  the  effective  index  and  loss  between  our  model  and  exact  model 


Structurea 

Characteristic  equation 

Mode 

Effective 

Loss 

Effective 

Loss 

index 

(dB/cm) 

index 

(dB/cm) 

(this  study) 

(this  study) 

(exact) 

(exact) 

One  sided 

/„  +  >21  =  0 

TE1 

1.4417085 

0.25 

1.4417085 

0.25 

ARROW-A 

TE2 

1.41798 

270 

1.4176 

407 

One  sided 

+  /:,  =  0 

TE1 

1.5382528 

0.11 

1.5382527 

0.11 

ARROW-B 

TE2 

1.5336896 

95 

1.5336856 

98 

Two  sided 

/j  ]  —  f  i2  +  ^21  —  ^22  =  0 

TE1 

1.4578523 

0.11 

1.4578523 

0.11 

ARROW  1 

TE2 

1.4518589 

76 

1.4518454 

98 

Two  sided 

i  ■+■  / 12  —  ?21  “  ?22  ”  0 

TE1 

3.1540497 

0.53 

3.1540497 

0.53 

ARROW  2 

TE2 

3.1393856 

103 

3.1393856 

113 

Two  sided 

+  / j 2  ?21  “**  ^22  ~  0 

TE1 

1.4578558 

0.06 

1.4578558 

0.06 

ARROW  3 

TE2 

1.4518726 

49 

1.4518551 

58 

ARROW 

U\  -  t2\  =  o 

Even 

3.1541037 

0.12 

3.1541037 

0.12 

Coupler 

Odd 

3.1539980 

0.14 

3.1539980 

0.14 

‘‘The  structures  (index,  thickness  (from  the  cap  to  substrate  layers)  and  wavelength)  of  calculated 
ARROWs  are  listed  below: 

1.  ARROW-A:  n  =  1/1.45/3.5/1.45/3.5:  d  =  ~/4/0.1019/2.0985/«>/pm;  /.  =  1.3  pm  (Jiang  et  al.  1989; 
Kubica  et  al  1990:  Huang  et  al.  1992). 

2.  ARROW-B:  n  =  1/1.54/1.46/1.54/3.85:  d  =  °°/4/0.3/2/°°/pm;  /.  =  0.633  pm  (Baba  and  Kokubun 
1992). 

3.  Two  sided  ARROW  1:  n  =  3.8/1.46/2.3/1.46/2.3/1.46/3.8:  d  =  ~/2/0.088/4/0.088/2/oo/pm;  a  = 
0.633  pm. 

4.  Two  sided  ARROW  2:  n  =  3.16/3.55/3.16/3.55/3.16/3.55/3.16/3.55/3.16;  d  =  -/0.237/2/0.237/4/0.237/ 
2/0.237/°®/ pm:  a  -  1.55  pm. 

5.  Two  sided  ARROW  3:  n  =  1 .46/2.3/1 .46/2.3/1 .46/2.3/1 .46/3.5;  d  =  ~/0.089/2/0.089/4/0.089/2/°°/pm; 
/.  -  0.6328  pm. 

6.  ARROW  coupler:  n/1/3. 16/3.55/3. 16/3.55/3. 16/3.55/3. 16/3.55/3. 16/3.55/3. 16/;  <//°°/2/0.237/4/0.237/2/ 
0.237/4/0.237/2/0.237/°°/:  A  =  1.55  pm  (Chen  and  Huang  1996). 


and  an  AlGaAs/GaAs  ARROW  waveguide.  Since  ARROW  has  a  very  small 
waveguide  dispersion  (Duguay  et  al.  1986;  Chu  et  al.  1996)  and  AlGaAs  has 
a  low  material  dispersion  compared  to  a  high  material  dispersion  of  InGaAsP 
around  1.55  pm,  this  structure  can  realize  a  narrowband  filter.  Another 
advantage  of  this  structure  is  that  ARROW’S  large  mode  allows  efficient 
coupling  with  fibers.  In  order  to  make  such  a  filter,  the  large  lattice  mismatch 
between  InP  and  GaAs  can  be  overcome  with  the  use  of  wafer  fusion  tech¬ 
nology  (Liu  et  al.  1999). 

To  optimize  the  design  of  this  ARROW,  first  we  use  the  approximation 
equations  in  Baba  et  al.  (1988)  to  get  the  initial  d\  (0.382  pm)  and  dj 
(2.204  pm),  then  use  an  iteration  technique  to  find  the  optimal  d\ 
(0.406  pm)  and  di  (2.21pm)  for  minimum  loss.  Since  there  is  no  root 
searching  in  the  complex  plane,  the  calculating  time  is  reduced  and  the 
calculated  value  is  the  same  as  the  rigorous  model.  For  example,  when 
7.  =  1.55  pm,  d\  -  0.406  pm,  di  =  2.21  pm,  starting  with  an  initial  effective 
index  3.24  and  an  initial  searching  step  0.00001,  our  model  takes  18  s  to 
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Fig.  3.  The  structure  of  InGaAsP/InP-AlGaAs/GaAs  ARROW  filter  and  the  profiles  of  even  and  odd 
inodes  The  detailed  structure  is  (from  left  to  right):  lnP/In0.8,Gao,„As„4P0.6  (0.403  pm)/InP 
(2  pm)/Ala4Gao.6As  (4gm)/GaAs  (d, )/Al0.4Gao,6As  (A)/GaAs  (d, )/Al„.4Ga0.6As  (<f2)/GaAs. 

find  the  root  with  <  le  -  10  error,  while  the  rigorous  model  takes  31s  on  a 
PC  (Pentium  II,  266  MHz).  The  thickness  of  InGaAsP/InP  waveguide  is 
chosen  to  be  0.403  urn  in  order  to  satisfy  the  phase  matching  around 
1.55  pm.  The  important  characteristics  of  the  filter  are  the  dispersion  curves 
of  the  two  waveguides.  Figs.  4  and  5  show  the  effective  index  and  the  loss 
of  the  separated  waveguides  and  the  even  and  odd  supermodes  of  the 
coupled  structure.  As  we  expect  from  the  design,  the  effective  indices  of 
two  separated  waveguides  are  the  same  at  the  phase  matching  point  at 
1.549  pm.  Without  coupling,  the  conventional  InGaAsP/InP  waveguide  has 
no  loss  (neglecting  the  loss  introduced  by  the  material  and  process),  and 
the  AlGaAs/GaAs  ARROW  has  a  small  loss,  which  increases  slightly  with 
the  wavelength.  When  the  two  waveguides  are  coupled,  the  losses  of  even 
and  odd  modes  of  the  coupler  are  the  same  at  the  phase  matching  con¬ 
dition.  The  field  profiles  of  even  and  odd  modes  are  also  calculated  and 
they  are  shown  in  Fig.  3.  All  these  calculations  match  very  well  the 
rigorous  model. 
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Wavelength  (pm) 

Fig.  4.  The  dispersion  for  the  modes  of  the  individual  waveguides  and  the  supermodes  of  the  coupler 
filter.  The  calculations  include  the  material  dispersion. 


5.  Conclusion 

In  this  paper,  a  set  of  simple  and  versatile  characteristic  equations  is  given 
for  different  ARROW  structures.  This  method  gives  the  precise  effective 
index,  loss  and  mode  profile  for  low  loss  ARROW  modes  of  practical  interest 
(i.e.  when  their  loss  is  smaller  than  a  few  dB/cm).  This  enables  root  searching 
in  the  real  domain.  The  physical  argument  is  simply  based  on  the  property  of 
antiresonance  and  small  loss,  which  means  that  the  reflectivity  at  the  outer¬ 
most  boundaries  is  near  1,  and  a  phase  relation  existing  for  the  field  at  the 
outermost  layers.  The  use  of  these  characteristic  equations  is  demonstrated 
for  different  ARROW  waveguides,  couples  and  filter.  This  simple  physical 
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Fig.  5.  The  ioss  of  even  and  odd  supermodes  of  InGaAsP/InP-AlGaAs/GaAs  ARROW  filters.  The  top 
thin  line  is  the  loss  of  the  separated  AlGaAs/GaAs  ARROW. 


argument  gives  an  accurate  intuitive  picture  for  loss  leaky  modes  and  it  can 
be  used  to  design  and  optimize  ARROW  devices. 


References 


Baba.  T.  and  Y.  Kokubun.  IEEE  J.  Quantum  Electron.  28  1689,  1992. 

Baba,  T.,  Y.  Kokubun,  T.  Sakakia  and  K.  Iga.  J.  Lightwave  Teclmol.  6  1440,  1988. 

Chen,  Y.H.  and  Y.T.  Huang.  J.  Lightwave  Teclmol.  14  1507,  1996. 

Chilwell,  J.  and  I.  Hodgkinson.  J .  Opt.  Soc.  Am.  A  1  742,  1984. 

Chu,  S.T..  M.  Miura  and  Y.  Kokubun.  IEEE  Photon.  Teclmol.  Lett.  8  1492.  1996. 

Duguay,  M.A..  Y.  Kokubum  and  T.L.  Koch.  Appl.  Phys.  Lett.  49  13,  1986. 

Gehler,  J.,  A.  Brauer  and  W.  Karthe.  Electron.  Lett.  30  218.  1994. 

Huang,  W.P.,  R.M.  Shubair,  A.  Nathan  and  Y.L.  Chow.  J.  Lightwave  Teclmol.  10  1015,  1992. 

Jiang,  W..  J.  Chrostowski  and  M.  Fontaine.  Opt.  Commun.  72  180,  1989. 

Kubica,  J..  D.  Uttamchandani  and  B.  Culshaw,  Opt.  Commun.  78  133.  1990. 

Liu,  B..  A.  Shakouri,  P .  Abraham,  Y.  Chiu,  S.  Zhang  and  J.E.  Bowers.  IEEE  Photon.  Technol  Lett  1193 
1999. 

Peierls,  R.  Surprises  in  theoretical  physics ,  Princeton  University  Press,  Princeton.  NJ,  1979. 

Trutschel,  U.,  F.  Ouellete,  V.  Delisle,  M.A.  Duguay,  G.  Fogarty  and  F.  Legerer.  J.  Lightwave  Technol  13 
239,  1995. 


Conclusion:  We  have  confirmed  that  the  temperature-coefficient  of 
the  average  dispersion  of  a  ZDF  transmission  line  is  100  times 
smaller  than  that  of  a  conventional  DSF.  Temperature-independ¬ 
ent  SOGbit/s  OTDM  transmission  using  the  168km  ZDF  transmis¬ 
sion  line  has  been  successfully  demonstrated.  No  penalty  was 
observed  for  the  ZDF  transmission  line  over  a  temperature  range 
of  50°C  without  adaptive  dispersion  equalisation  while  the  penalty 
for  151km  DSF  transmission  was  4.1  dB. 
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Wavelength  routing  of  40Gbit/s  packets 
with  2.5Gbit/s  header  erasure/rewriting 
using  all-fibre  wavelength  converter 

B.-E.  Olsson,  P.  Ohlen,  L.  Rau,  G.  Rossi, 

O.  Jerphagnon,  R.  Doshi,  D.S.  Humphries, 

D.J.  Blumenthal,  V.  Kaman  and  J.E.  Bowers 

40Gbit/s  packet  wavelength  routing  and  2.5Gbit/s  header 
replacement  is  demonstrated  using  an  ultra-high-speed  wavelength 
converter.  Burst-mode  bit  error  rate  measurements  are  performed 
on  both  the  header  and  payload  before  and  after  wavelength 
routing,  with  <  4dB  penalty  in  the  payload  and  <  2.5dB  penalty 
in  the  header. 

Introduction:  The  exponential  growth  in  Internet  traffic  is  forcing 
next  generation  IP  data  networks  to  a  scale  far  beyond  present 
performances.  Optical  packet  switching  technologies  may  be 
required  to  deliver  low-latency  packet  forwarding  and  routing  at 
terabit  wire  rates  and  should  support  header  erasure  and  rewrit¬ 
ing.  The  latter  is  important  for  all-optical  label  swapping  (AOLS) 
[1]  and  new  simpler  IP  routing  protocols  such  as  multi-protocol 
label  switching  (MPLS)  [2].  Initial  experiments  on  AOLS  with 
packet  rate  wavelength  conversion  [3]  and  optical  packet  switching 
[4]  have  been  reported  as  well  as  basic  routing  experiments  at 
100  Gbit/s  using  all-optical  header  processing  [5].  However,  dem¬ 
onstrations  of  wavelength  routing  with  bit  error  rate  (BER)  meas¬ 
urements  and  header  replacement  have  been  limited  to  bit  rates  at 
10  Gbit/s  or  below,  mainly  due  to  speed  limitations  in  the  wave¬ 
length  converter  technology.  In  this  Letter  we  report  the  wave¬ 
length  routing  of  packets  with  40  Gbit/s  retum-to-zero  (RZ) 
payloads  and  replacement  of  2.5  Gbit/s  time  domain  non-return  - 
to-zero  (NRZ)  headers.  Every  other  packet  is  routed  to  two  differ¬ 
ent  wavelengths  while  simultaneously  erasing  the  old  header  and 
rewriting  a  new  one  on  the  new  wavelength.  A  novel  wavelength 
converter  (WC)  [6]  plays  a  key  role  in  the  header  replacement 
process  by  wavelength  converting  the  payload  while  simultane¬ 
ously  erasing  and  rewriting  the  header.  Burst-mode  BER  measure¬ 
ments  were  performed  on  packets  for  the  payload  before  and  after 
wavelength  conversion,  and  on  the  original  and  the  replaced 
header.  The  possible  switching  speed  and  packet  length  in  this 
architecture  are  determined  by  the  tunable  laser  in  the  wavelength 
converter,  presently  "“5  ns.  The  WC  is  based  on  cross-phase  modu¬ 
lation  (XPM)  in  an  optical  fibre,  which  has  the  potential  for  oper¬ 
ating  at  ultra-high  bit  rates.  When  the  incoming  data  are 
combined  with  a  continuous  wave  (CW)  signal  and  sent  through 
an  optical  fibre,  the  data  impose  a  phase  modulation  on  the  CW 
light  through  XPM.  This  phase  modulation  generates  optical  side¬ 
bands  in  the  CW  signal,  which  can  be  converted  to  amplitude 
modulation  by  suppressing  the  original  CW  carrier  using  an  opti¬ 
cal  notch  filter.  In  this  Letter,  a  loop  mirror  filter  (LMF)  was  used 
which  consisted  of  a  short  piece  of  birefringent  fibre  in  a  Sagnac 
interferometer.  Such  a  filter  is  tunable,  polarisation  independent, 
and  has  repetitive  notches,  which  enables  conversion  to  different 
equally  spaced  wavelengths  to  be  achieved  without  any  further 
adjustment.  The  transfer  function  of  the  WC  is  nonlinear,  thus  if 
the  amplitude  of  the  input  signal  is  low,  a  very  small  portion  is 
converted  to  the  new  wavelength.  Hus  phenomenon  is  used  to 
remove  the  header  of  the  packet  since  the  header  peak  power  can 
be  substantially  lower  than  the  RZ  payload  while  still  keeping  the 
same  energy  per  bit  in  the  header  and  the  payload. 

Experimental  results :  Fig.  1  shows  the  experimental  setup.  The 
packet  generator  consisted  of  an  actively  modelocked  fibre  ring 
laser  generating  I  Ops  pulses  with  a  time-bandwidth  product  (TBP) 
of  0.45  at  1536nm  followed  by  an  LiNbOj  modulator  encoding 
lOGbit/s  PRBS  data  with  a  word  length  of  231-1.  The  lOGbit/s 
data  were  injected  into  a  passive  10  to  40  Gbit/s  multiplexer  con¬ 
sisting  of  polarisation  maintaining  (PM)  fibre  with  75  and  150ps 
delays.  An  acousto-optical  modulator  (AOM)  gated  out  a  2.5  ps 
payload  that  was  combined  with  a  2.5 Gbit/s  (PRBS  27-1)  500ns 
long  header.  The  header  was  aligned  in  front  of  the  payload  with 
a  100ns  guard  band  determined  by  the  100ns  rise  time  in  the 
AOM,  and  a  100ns  guard  band  was  inserted  between  each  packet, 
giving  a  total  packet  length  of  3.2ps.  The  packets  were  injected 
into  the  WC  which  consisted  of  an  erbium-doped  fibre  amplifier 
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(EDFA)  with  an  average  output  power  of  +  I8dBm  followed  by 
5km  dispersion-shifted  fibre  (DSF)  with  a  zero-dispersion  wave¬ 
length  of  3542nm.  A  grating  coupled  sampled  rear  reflector 
(GCSR)  laser  [7]  that  could  be  tuned  to  either  1538  or  1543nm 
within  5ns  determined  the  new  wavelength.  The  GCSR  laser  was 
also  used  to  encode  the  new  2.5Gbit/s  header  before  entering  the 
WC.  After  the  DSF,  an  LMF,  consisting  of  a  loop  mirror  with  4m 
of  PM  fibre  and  a  polarisation  controller  to  allow  adjustment  of 
the  filter  wavelength,  was  used  to  suppress  the  original  CW  light. 
The  separation  between  the  notches  was  1  nm  (given  by  the  length 
of  the  PM  fibre)  and  the  suppression  was  better  than  27dB.  A  sec¬ 
ond  filter  was  used  to  select  one  of  the  two  sidebands  and  to  sup¬ 
press  the  original  data.  The  use  of  only  one  sideband  retained  the 
pulse  width  and  TBP  from  the  input  pulse.  The  40Gbit/s  receiver 
consisted  of  a  40  to  lOGbit/s  demultiplexer  followed  by  a  lOGbit/s 
preamplified  receiver.  The  demultiplexer  was  based  on  an  electro¬ 
absorption  modulator  (EAM),  driven  with  a  30ps  electrical  pulse 
to  achieve  a  15ps  switching  window.  The  2.5Gbit/s  header  receiver 
consisted  of  an  AOM  to  remove  the  payload,  which  otherwise 
would  dominate  the  measured  optical  average  power,  followed  by 
an  optically  preamplified  receiver.  The  BER  measurements  were 
made  on  both  the  payload  and  header,  and  gated  to  select  the 
appropriate  time  interval. 


packet  transmitter  wavelength  router  with  header 


M°D:  LiNb03  modulator;  MUX:  passive  10  to  40  Gbit/s  multi¬ 
plexer;  PC:  polarisation  controller;  EAM  DEMUX:  demultiplexer 
using  electroabsorption  modulator;  BERT:  bit  error  rate  test  set 


Fig.  26  shows  every  other  packet  routed  to  1538  and  1543nm  with 
new  inserted  headers.  The  eye  patterns  of  Fig.  2c  and  d  were 
taken  zooming  into  the  payload,  before  (1536nm)  and  after  the 
WC  (1543nm),  BER  measurements  were  performed  on  both  the 
header  and  payload  on  incoming  and  outgoing  packets.  The  BER 
detector  was  gated  and  error  measurements  could  only  be  per¬ 
formed  on  -80%  of  all  bits  in  the  packets  due  to  the  number  of 
bits  required  to  synchronise  the  BER  detector.  Fig.  3a  shows  the 
BER  for  all  four  lOGbit/s  TDM  channels  in  the  40Gbit/s  payload 
for  the  input  packets  at  1536  nm  and  the  output  packets  at  1543 
and  1538nm.  All  channels  at  both  output  wavelengths  have  <  4dB 
penalty  compared  with  the  input  packets,  and  this  penalty  is 
believed  to  be  due  to  non-optimal  filtering  in  the  wavelength  con¬ 
verter  and  to  polarisation  instabilities  in  the  input  40Gbit/s  data. 


b 


Fig.  3  Bit  error  rate  measurements 
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D  V  .  e  measurements  lor  40Gbit/s  before  and  after  wave¬ 


length  routing 
B  lOGbit/sdata 
A  lOGbit/s  packet 
A  channel  1,  1536nm 
♦  channel  2,  1536nm 
□  channel  3,  1536nm 
O  channel  4,  1536nm 
A  channel  1,  1538nm 


O  channel  2,  1538nm 
▼  channel  3,  1538nm 
X  channel  4,  1538nm 
O  channel  1,  I543nm 
cv3  channel  2,  1543nm 
O  channel  3,  1543nm 
+  channel  4,  1543nm 


0)  40 Gbit/s  wavelength  conversion 

(ii)  40Gbit/s  input  packets 

(iii)  lOGbit/s  baseline 

b  Bit  error  rate  measurements  for  2.5 Gbit/s  baseline,  original  header 
and  replaced  headers 

•  baseline  C W  A  header  1 538  nm 

A  original  header  ♦  header  1543nm 


Fig.  2  Packets  and  eye-patterns  before  and  after  wavelength  routing 
a  Packets  before  wavelength  routing 
(i)  40Gbii/s  RZ  payload  and  2.5Gbit/s  NRZ  header 
(u)  headers  without  payload 
b  40 Gbit/s  eye-patterns  before  wavelength  routing 
c  Packets  routed  to  1538  and  1543mn 
d  Eye  pattern  of  routed  packet  at  1543nm 


Fig.  2a  shows  the  input  packets  consisting  of  the  40Gbit/$  RZ 
payload  and  the  2.5Gbit/s  NRZ  header  with  -lOdB  less  peak 
power.  The  lower  trace  shows  the  headers  without  the  payload. 


At  lOGbit/s  wavelength  conversion  the  penalty  was  <  ldB  which 
indicates  that  a  major  part  of  the  penalty  is  due  to  instabilities  in 
the  40Gbit/s  data.  Fig.  36  shows  BER  measurements  of  the  old 
and  replaced  headers.  The  old  header  is  completely  removed  in  the 
wavelength  converter  and  a  new  header  could  be  successfully 
inserted,  with  no  crosstalk  penalty  from  the  old  header.  However, 
a  receiver  penalty  of  2.5 dB  was  measured,  probably  due  to  distor¬ 
tion  from  the  sharp  notches  of  the  LMF.  In  a  filter  with  a  flatter 
stop  band,  e.g.  a  fibre  Bragg  grating,  this  distortion  would  proba¬ 
bly  be  avoided. 
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Characterisation  of  single  stage,  dual- 
pumped  Raman  fibre  amplifiers  for  different 
gain  fibre  lengths 

F.  Koch,  S.V.  Chemikov,  S.A.E.  Lewis  and  J.R.  Taylor 

The  power  dependent  gain  length  of  dispersion  shifted  fibre 

Raman  amplifiers  has  been  characterised.  The  noise  performance 

has  also  been  investigated. 

Broadband  silica  fibre  Raman  amplifiers  (FRAs)  are  particularly 
attractive  for  ultra-broadband  applications  in  telecommunications. 
Raman  amplifiers  can  operate  throughout  the  low  loss  window'  of 
optical  fibres  from  1.1  to  1.7  pm  compared  with  conventional 
erbium  doped  amplifiers  which  are  deployed  in  the  1.55  pm  region. 
Raman  amplification  offers  the  advantages  of  greatly  extended 
bandwidth  and  distributed  amplification  [1]  with  the  installed  fibre 
as  the  gain  medium..  Here  we  report  the  characterisation  of  the 
pump  power  requirements  and  spontaneous  noise  performance  of 
a  dual  wavelength  pumped  Raman  amplifier  for  a  wide  range  of 
gain  fibre  lengths  covering  the  range  applicable  to  discrete  or  dis¬ 
tributed  amplification. 

The  amplifier  was  constructed  from  sections  of  dispersion 
shifted  fibre  of  different  lengths  spliced  together,  with  optical  cir¬ 
culators  employed  to  multiplex  and  demultiplex  the  pump  and  sig¬ 
nal.  The  single  stage  amplifier  was  counter-pumped  [2]  using  fibre 
Raman  lasers  operating  at  fundamental  wavelengths  of  1423  and 
1455nm.  This  configuration  allows  broadband  amplification  in  the 
1.5pm  region  [3,  4].  For  each  amplifier  length,  the  pump  power 
was  adjusted  to  achieve  a  peak  gain  of  15dB.  At  lower  pump 
power  levels  the  trends  shown  in  the  gain  and  noise  figures  were 
similar  to  those  at  15  dB  which  are  exclusively  reported  in  this 
Letter. 

A  CW  diode  laser  tunable  from  1500  to  1580nm  at  a  power  of 
~20dBm  was  used  as  the  signal  source.  The  measurements  were 
carried  out  with  a  computer-automated  system  controlling  the 
tunable  laser  and  optical  spectrum  analyser  using  the  spectral  divi¬ 
sion  method.  For  each  amplifier  length  the  pump  power  was 
adjusted  to  equalise  the  gain  peaks  to  15dB.  The  pump  power  was 
calibrated  using  a  broadband  thermal  detector  with  an  accuracy  of 
6%,  with  the  error  in  this  measurement  arising  primarily  from  the 
calibration  of  the  power  meter  and  an  additional  small  systematic 
error  dependent  on  the  repeatability  of  the  splice  to  the  circulator. 
Unlike  file  erbium  amplifier  the  spontaneous  noise  figure  does  not 
depend  sensitively  on  the  pump  power. 

Fig.  1  shows  the  required  pump  powers  for  the  1423  and 
3455nm  pumps  to  achieve  a  15dB  gain  at  the  two  gain  peaks 
around  1523  and  1555nm,  respectively.  The  short  wavelength 
pump  is  operated  at  a  higher  power  due  to  the  inter-pump  Raman 


interaction  which  causes  a  progressive  power  transfer  from  the 
short  to  the  long  wavelength  pump  as  these  signals  propagate 
along  the  length  of  the  amplifier.  The  overall  required  pump 
power  decreases  with  the  amplifier  length  as  the  effective  nonlin¬ 
ear  interaction  length  grows,  while  the  differential  pump  power 
increases  as  a  result  of  the  inter-pump  interaction.  Around  an 
amplifier  length  of  ~23km  the  amplifier  starts  to  become  very 
weakly  pumped  at  the  input  and  the  pump  power  at  1423nm  has  :, 
to  be  increased  to  maintain  the  desired  gain.  In  the  case  of  an 
unsaturated  amplifier  the  maximum  gain  is  limited  by  the  direct 
loss  at  the  signal  wavelength  and  the  effective  length  given  by  the 
loss  at  the  pump  wavelength  [5].  From  the  minimum  required 
pump  power  from  Fig.  1  it  can  be  estimated  that  the  pump  power 
is  used  most  efficiently  for  the  1423  run  pump  when  the  amplifier 
length  is  less  than  ~23km.  This  length  is  slightly  longer  than  the 
effective  nonlinear  length  for  a  single  wavelength  pumped  ampli¬ 
fier  as  a  result  of  the  inter-pump  Raman  interaction. 


fig.  1  Required  pump  power  for  15  dB  maximum  gain 

— AL —  1423nra 
— # —  1455nm 


Fig.  2  Noise  figure  and  gain  for  different  amplifier  lengths 


19km 
I  19km 
k  46km 


Fig.  2  shows  the  representative  gain  and  noise  figure  spectra  for 
three  different  lengths  for  peak  gains  of  15dB.  All  the  measure¬ 
ments  include  the  loss  of  the  splices  and  circulator  coupling  losses 
which  add  to  the  noise  figure.  The  variations  in  the  gain  shape  are 
mainly  due  to  slight  variations  in  the  pump  peak  wavelength  and 
spectral  width  with  different  pump  powers.  Some  gain  narrowing 
occurs  for  longer  fibre  lengths  because  the  gross  gain  is  higher  to 
compensate  for  the  increased  loss,  but  the  main  contribution  is 
due  to  the  pump  variations.  This  has  been  confirmed  by  numerical 
simulation  [3].  The  variation  in  noise  figure  for  three  representa¬ 
tive  wavelengths  is  shown  against  amplifier  length  in  Fig.  3.  As 
expected,  the  noise  figure  increases  with  length  as  the  fibre 
becomes  less  strongly  pumped  at  the  amplifier  input  and,  for 
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A  wavelength  multiplexer  using  cascaded  three-dimensional 
vertical  couplers 
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A  four-channel  wavelength  multiplexer/demultiplexer  for  sparse  wavelength  division  multiplexing 
was  demonstrated  using  cascaded  three-dimensional  (3D)  vertical  couplers.  17  nm  channel  spacing 
with  crosstalk  less  than  15  dB  was  achieved.  Strong  coupled  vertical  couplers  were  fabricated  using 
wafer  bonding  to  invert  a  conventionally  processed  epitaxial  layer  and  bond  to  a  new  host  substrate° 
This  technology  makes  the  fabrication  of  3D  photonic  integrated  circuits  and  the  realization  of 
multilevel  optical  interconnects  possible.  ©  2000  American  Institute  of  Physics 
[S0003-695 1(00)03303-9]  ' 


Increasing  integration  density  is  a  key  factor  to  reduce 
cost  and  improve  performance  in  optoelectronic  circuits.  In 
conventional  wafer  processing,  only  one  side  of  epitaxial 
films  are  used  to  fabricate  photonic  integrated  circuits.  If 
both  sides  of  the  epitaxial  layers  can  be  processed,  three- 
dimensional  (3D)  structures  can  be  fabricated  and  the  inte¬ 
gration  density  can  be  considerably  increased.  Since  epitax¬ 
ial  layers,  by  themselves,  are  too  thin  to  handle  and  process 
directly  they  must  be  transferred  and  bonded  to  other  host 
substrates.  For  optoelectronic  device  applications,  the 
bonded  interface  must  be  both  electronically  and  optically 
transparent  to  make  multilevel  electrical  and  optical  inter¬ 
connects  possible.  The  requirement  for  optical  transparency 
means  that  the  conventional  bonding  techniques  such  as  flip- 
chip  solder  bonding  are  not  suitable  for  photonic  integration. 
Wafer  bonding,1 "  on  the  other  hand,  provides  an  interface 
with  low  electrical  and  optical  loss.3-5  In  this  letter,  we  dem¬ 
onstrate  a  wavelength  multiplexer  by  cascading  strong 
coupled  3D  vertical  couplers  using  wafer  bonding  technique 
and  double-sided  processing. 

Wavelength  multiplexers  and  demultiplexers  (MUX/ 
DEMUX)  are  the  essential  components  in  wavelength  divi¬ 
sion  multiplexing  (WDM)  networks.  Although  multiplexers 
based  on  directional  couplers  have  been  studied  for  a  long 
time,  they  are  applicable  only  to  the  case  where  the  channel 
spacing  is  quite  large  (e.g.,  980  and  1550  nm  or  1300  and 
1550  nm).6  This  is  because  of  the  weak  wavelength  depen¬ 
dence  of  the  coupling  coefficient  in  horizontally  arranged 
directional  couplers.  Otherwise  a  very  long  device  length  is 
required.  Compared  to  horizontal  couplers,  vertical  couplers 
offer  much  stronger  coupling  since  the  thickness  of  the  guid¬ 
ing  layer  and  the  space  between  two  waveguides  can  be  very 
small  and  precisely  determined  by  the  epitaxial  growth.4,7 
The  multichannel  multiplexers  with  narrow  channel  spacing 
can  be  achieved  by  cascading  several  strongly  coupled  ver¬ 
tical  couplers.  However,  separating  the  two  input  and  output 
waveguides  in  conventional  vertically  coupled  twin  wave¬ 
guide  structures  is  difficult.  This  limits  practical  applications, 
and  makes  the  cascading  of  several  vertical  couplers  impos¬ 
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sible.  This  problem  is  solved  using  double-sided  epitaxial 
layer  processing  by  wafer  bonding  in  this  letter. 

The  schematic  drawing  in  Fig.  1(a)  shows  a  four-channel 
multiplexer,  which  cascades  two  stages  of  3D  vertical  cou¬ 
plers  with  different  lengths.  As  can  be  seen  in  Fig.  1(b),  the 
two  waveguides  of  the  3D  vertical  couplers  are  coupled  ver¬ 
tically  and  separated  horizontally  in  different  planes.  This 
separation  makes  the  cascading  of  vertical  couplers  possible. 
The  operation  principle  of  a  multiplexer  by  cascading  verti- 


FIG.  1.  (a)  Schematic  drawing  of  a  four-channel  MUX/DEMUX.  Solid  lines 
are  the  top  waveguides  and  the  dashed  lines  are  the  bottom  ones,  and  (b)  3D 
vertical  coupler  with  horizontally  separated  input  and  output  waveguides. 
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FIG.  2.  The  SEM  picture  of  the  output  facet  of  a  four-channel  MUX/ 
DEMUX. 

cal  couplers  is  the  same  as  that  of  two-mode-interference 
(TMI)  based  multiplexers.8’9  The  output  intensity  of  each 
stage  is  a  periodic  function  of  the  optical  frequency.  The 
period  is  inversely  proportional  to  the  coupler  length.  Verti¬ 
cal  coupler  1  (VC1)  is  twice  as  long  as  that  of  VC2  in  the 
second  stage,  so  the  wavelength  oscillation  period  (channel 
spacing)  of  VC1  is  half  that  of  VC2.  VC3  has  nearly  the 
same  wavelength  period  as  VC2,  but  the  peaks  in  the  output 
should  be  shifted  by  a  quarter  period  to  match  the  channels. 
This  can  be  achieved  by  a  small  adjustment  to  the  length  of 
VC3.  In  our  design,  after  considering  the  coupling  in  the 
5-bend  regions  using  a  3D  beam  propagation  analysis,  the 
parameters  are  chosen:  Li  =  5mm,  —  2.397 mm,  L3 
=  2.37  mm  for  the  vertical  couplers  and  5=10^tm,  Ls 
=  500  /mm  for  5-bends. 

The  epitaxial  structure  is  grown  using  metal  organic 
chemical  vapor  deposition  (MOCVD).  It  includes  a  0.8  ^tm 
InP  frontside  ridge  layer,  a  15  nm  InGaAsP  (band  gap  1.1 
/mm)  etch  stop  layer,  a  0.1  /mm  InP  cap  layer,  a  0.5  /mm 
InGaAsP  (band  gap  1.3  /mm)  frontside  guiding  layer,  a  0.6 
/mm  InP  coupling  layer,  the  same  backside  guiding,  cap,  etch 
stop,  and  ridge  layers,  and  finally  a  0.2  /mm  InGaAs  layer 
used  to  remove  the  InP  substrate.  First,  the  3  /mm  width  front¬ 
side  ridge  waveguides  are  formed  by  reactive  ion  etching 
(RIE)  and  chemical  wet  etching.  The  frontside  guiding  layer 
above  the  backside  waveguides  in  noncoupling  areas  is  re¬ 
moved  by  another  step  of  photolithography  and  wet  etching. 
The  waveguide  sample  is  then  inverted  and  bonded  to  a  bare 
InP  host  substrate  under  pressure  for  50  min  at  630  °C  in  H2 
atmosphere.  After  removing  the  original  InP  substrate  and 
InGaAs  etch  stop  layer,  the  alignment  windows  are  opened 
by  photolithography  and  wet  etching  to  expose  alignment 
marks,  then  the  other  side  waveguides  are  fabricated  and  the 
unneeded  guiding  layers  are  removed  as  before.  Figure  2 
shows  a  scanning  electron  microscope  (SEM)  picture  of  the 
output  facet  of  a  four-channel  device.  The  device  length  is 


ISOlnm 


1524nm 

1561nm 

1543mrt 


1508nm 


Wavetength(nm) 


FIG.  4.  Four-channel  multiplexer  transmission  spectra. 


about  1.4  cm  after  cleaving  both  facets.  The  transferred  and 
bonded  thin  epitaxial  film  is  robust  and  it  is  processed  using 
the  same  techniques  as  other  planar  photonic  integrated  cir¬ 
cuits. 

To  characterize  the  device  performance,  light  from  a 
tunable  laser  was  coupled  to  an  input  waveguide  by  a  single 
mode  fiber.  The  light  at  the  output  was  collected  by  another 
single  mode  fiber  that  was  connected  to  a  detector.  The  near 
field  images  were  recorded  by  an  infrared  (IR)  camera  with  a 
20  X  lens.  Figure  3  shows  the  near  field  images  at  different 
wavelengths:  1508,  1524,  1543,  and  1561  nm,  which  are  the 
peak  response  of  the  four  channels.  The  corresponding  out¬ 
put  light  intensity  as  a  function  of  wavelength  for  the  four 
channels  is  shown  in  Fig.  4.  The  free  spectral  range  is  about 
68  nm,  as  can  be  seen  in  the  response  of  channel  2.  The 
channel  spacing  is  17  nm  for  a  four-channel  multiplexer.  The 
measured  adjacent  channel  crosstalk  ranges  from  — 13  dB  to 
—  20  dB.  This  can  be  further  improved  by  fine  tuning  the 
second  stage  of  vertical  couplers  to  overcome  fabrication  im¬ 
perfections.  The  channel  spacing  can  be  reduced  by  increas¬ 
ing  the  device  length  or  the  wavelength  dependence  of  the 
coupling  coefficient.  Figure  5  shows  the  transmission  spec¬ 
trum  of  a  2-channel  MUX/DEMUX  with  an  8  mm  interac¬ 
tion  length.  The  channel  spacing  is  about  1 1  nm. 

In  conclusion,  a  four-channel  wavelength  multiplexer 


FIG.  3.  Near  field  images  of  a  four-channel  multiplexer  at  different  wave-  FIG.  5.  The  transmission  spectrum  for  a  two-channel  MUX/DEMUX  with 
lengths.  an  8  mm  interaction  length.  , 
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using  cascaded  3D  vertical  couplers  has  been  successfully 
demonstrated.  17  run  channel  spacing  with  crosstalk  less 
than  13  dB-20  dB  is  achieved.  By  cascading  additional 
stages  of  vertical  couplers,  a  multichannel  MUX/DEMUX 
can  be  realized.  Using  wafer  bonding,  a  conventionally  pro¬ 
cessed  epitaxial  layer  can  be  inverted  and  bonded  to  a  new 
host  substrate.  The  backside  of  the  epitaxial  films  can  then  be 
processed  as  well.  This  kind  of  double-sided  processing 
makes  the  fabrication  of  3D  photonic  integrated  circuits  pos¬ 
sible  and  increases  the  integration  density. 

This  work  was  supported  by  DARPA  and  AFOSR  under 
the  Multidisciplinary  Optical  Switching  Technology 
(MOST)  center. 
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Fig.  3  Eye  diagram  of  100  Gbit/s  signal  photodiode  bandwidth  50  GHz) 

Insets:  Eye  diagrams  showing  clear  and  open  eyes  of  two  demulti¬ 
plexed  JOGbit/s  signals 


Fig.  4  Measured  BER  of  100  Gbit/s  wavelength  converted  signal  against 
received  power  at  input  to  optical  preamplifier 
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3.7 ps  pulse  generation  at  >  30 GHz  by  dual¬ 
drive  electroabsorption  modulator 

V.  Kaman,  Yi-Jen  Chiu,  S.Z.  Zhang  and  J.E.  Bowers 


Eye  diagrams  of  the  lOOGbit/s  signal  as  recorded  with  a  500 Hz 
bandwidth  photodiode  are  shown  in  Fig.  3.  There  is  not  sufficient 
resolution  to  resolve  the  lOOGbit/s  eye  diagram.  However,  the 
lOOGbit/s  signal  was  subsequently  demultiplexed  into  ten  signal 
streams  of  lOGbit/s  [9]  and  the  eye  diagrams  of  these  demulti¬ 
plexed  signals  show  clear  and  open  eyes.  The  eye  diagrams  of  the 
second  and  ninth  demultiplexed  signals  arc  shown  as  an  example 
in  the  lower  left  and  right  inset  of  Fig.  3.  The  BER  of  the  con¬ 
verted  100 Gbit/s  signal,  as  shown  in  Fig.  4,  was  measured  after 
demultiplexing  back  to  lOGbit/s  and  feeding  this  signal  to  an  opti¬ 
cally  preamplificd  pin  receiver.  Thus,  the  received  power  was 
measured  for  lOGbit/s.  The  penalty  is  due  to  format  conversion 
(~2dB),  extinction  ratio  degradation  (~0.5  dB),  the  pattern  depend¬ 
ence  of  long  words  (~3dB)  and  signal- to-noisc  ratio  degradation. 
All  ten  demultiplexed  signals  gave  a  BER  within  1  dB  around  the 
depicted  curve.  The  polarisation  sensitivity  against  the  input  signal 
was  below  2dB. 

Conclusions:  We  have  performed  the  first  lOOGbit/s  wavelength 
conversion  experiment  exploiting  cross-phase  modulation  in  an 
SOA.  A  novel  compact  and  fully  packaged  SOA  delayed-intcrfer- 
encc  wavelength  converter  was  employed  to  perform  the  experi¬ 
ments.  BER  measurements  show  that  these  arc  the  best  SOA 
based  wavelength  conversion  results  ever  obtained  at  lOOGbit/s. 

Acknowledgments:  T.L.  Koch  and  M.  Zimgibl  are  acknowledged 
for  support  of  this  work.  P.A.  Besse  is  acknowledged  for  fruitful 
discussions  with  J.  Leuthoid. 
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The  authors  describe  optical  short  pulse  generation  at  frequencies 
>  30 GHz  by  a  dual-drive  scheme  of  a  high-saturation  power 
travelling-wave  clcctroabsorption  modulator.  Sub-4  ps  almost- 
Iransform-limiied  pulses  are  achieved  with  >  20dB  dynamic 
extinction  ratio  and  low  polarisation  sensitivity. 

Introduction :  Optical  fibre  transmission  based  on  single  channel 
optical  time  division  multiplexing  (OTDM)  has  recently  attracted 
a  lot  of  attention  as  a  means  of  upgrading  future  TDM  systems 
[1,  2].  Sinusoidally  driven  electroabkirption  (EA)  modulators  play 
a  key  role  in  OTDM  systems  as  optical  short  pulse  generators  and 
optical  demultiplexers.  Owing  to  advances  in  high-speed  electrical 
TDM,  it  is  inevitable  that  next-generation  OTDM  systems  will 
operate  at  a  base  rate  of  40Gbil/s  with  optical  multiplexing  to 
l60Gbit/s  or  more  [3].  Therefore,  it  is  important  to  investigate  the 
high-frequency  switching  performance  of  EA  modulators.  Owing 
to  its  nonlinear  attenuation  characteristic,  a  highly  reverse  biased 
EA  modulator  with  an  applied  sinusoidal  RF  signal  is  capable  of 
producing  switching  windows  with  duty  ratios  as  small  as  7.2% 
[4].  However,  since  the  optical  loss  depends  strongly  on  the  inser¬ 
tion  loss  and  the  duty  ratio  of  the  pulses,  the  average  optical  out¬ 
put  power  and  consequently  the  sigual-to-noise  ratio  (SNR), 
especially  at  high  frequencies,  can  be  very'  low.  Therefore,  an  EA 
modulator  with  a  high-saturation  input  power  is  required  [5]. 
Another  limiting  factor  at  high-frequency  operation  is  the  avail¬ 
able  RF  power  as  well  as  the  response  of  the  modulator,  which 
can  result  in  broader  pulses  with  degraded  extinction  ratios  than  is 
theoretically  predicted  [6]. 

In  this  Letter,  wc  demonstrate  >  30GHz  pulse  generation  of  a 
high-saturation  power  travelling-wave  EA  modulator.  A  novel 
dual-drive  scheme  is  employed  to  effectively  double  the  RF  drive 
and  to  achieve  <  15%  duly  ratios,  which  we  believe  is  the  smallest 
duty  ratios  ever  reported  for  these  frequencies  using  a  single  EA 
modulator. 
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Experimental  setup:  The  travelling-wave  EA  modulator,  fabricated 
using  MOCVD  grown  InGaAsP-InGaAsP  quantum  wells,  is  simi¬ 
lar  to  the  device  used  in  the  previously  demonstrated  30 Gbit/s 
data  modulation  experiment  [7].  The  fibre-to-fibre  insertion  loss  at 
1555nm  was  10.8dB  while  the  maximum  extinction  ratio  was  36.4 
and  40.3d B  at  a  reverse  bias  of-6V  for  the  TE  and  TM  polarisa¬ 
tions,  respectively.  It  is  important  to  mention  that  the  optical 
input  power  was  +7.5  dBm,  which  demonstrates  the  high  satura¬ 
tion  power  characteristic  of  this  2pm  wide,  300pm  long  modula¬ 
tor.  The  3dBe  bandwidth  of  the  device  was  26GHz. 


. 

RF  source 


Fig.  1  Experimental  setup  for  dual-drive  scheme  of  EA  modulator 
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Optical  output  is  a  typical  40 GHz  waveform 
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Fig.  2  Pulsewidth  against  reverse  bias  for  30GHz  modulation 

O  single  drive 
•  dual  drive 

Upper  inset:  autocorrelation  traces  of  pulses 

-  single  drive 

- dual  drive 

Lower  inset:  optical  spectrum  for  dual-drive  modulation 

Fig.  1  shows  the  device  operation  for  the  dual-drive  scheme. 
Two  RF  signals,  synchronised  by  an  electrical  delay  line,  were  fed 
into  the  two  electrodes  of  the  modulator  while  a  single  reverse  bias 
was  applied.  Since  AC-coupled  amplifiers  effectively  terminated 
both  electrodes,  heating  effects  were  reduced  and  external  temper¬ 
ature  cooling  was  not  employed.  The  optical  output  of  the  modu¬ 
lator  was  then  amplified  by  an  optical  amplifier,  which  was 
followed  by  a  l.9nm  optical  bandpass  filter.  The  output  pulses 
were  then  measured  on  a  second -harmonic  generation  autocorrela¬ 
tor  and  an  optical  spectrum  analyser.  The  width  of  the  obtained 
pulses  was  deconvolved  <t$suming  a  Gaussian  pulse  shape  as 
inferred  from  the  optical  spectrum  measurements.  It  is  very  impor¬ 
tant  to  note  that  the  following  criteria  were  used  for  the  these 
measurements:  (i)  the  average  optical  output  power  was  higher 
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than  -25dBm  in  order  to  ensure  a  high  SNR,  and  (ii)  the  dynamic 
extinction  ratio  was  estimated  to  be  >  20dB. 


Fig.  3  Puhewidth  against  reverse  bias  for  40  GIIz  dual-drive  modulation 

Upper  inset:  autocorrelation  trace  of  pulse 
Lower  inset:  optical  spectrum 


High-frequency  operation:  Fig.  2  shows  the  obtained  pulsewidths 
as  a  function  of  reverse  bias  at  30GHz.  The  RF  amplifiers  gener¬ 
ated  7Vpp  into  a  50£2  load.  Using  only  a  single  drive  (the  other 
electrode  was  terminated  in  50H),  a  minimum  pulsewidth  of  4.7ps 
was  achieved  at  a  reverse  bias  of  -5.5V.  Under  the  dual-drive 
modulation,  an  almost-transfonn-limiled  pulsewidth  of  3.7ps  was 
obtained  with  a  low  polarisation  sensitivity  of  0.3ps.  This  result 
corresponds  to  a  duty  ratio  of  -11%  at  30  GHz.  Further  pulse 
compression  was  not  observed  when  the  EA  modulator  was  fol¬ 
lowed  by  dispersion-compensating  fibre.  The  inset  in  Fig.  2  shows 
the  autocorrelation  trace  of  the  pulses  achieved  for  single  and  dual 
modulation.  The  optical  spectrum  obtained  for  the  dual-drive 
operation  is  also  shown  as  an  inset  in  Fig.  2.  The  bandwidth  was 
0.84 nm,  resulting  in  a  time-bandwidth  product  of  0.39.  It  is 
important  to  mention  that  shorter  pulsewidths  were  obtained  at 
higher  reverse  biases  at  the  expense  of  a  degraded  SNR  and 
dynamic  extinction  ratio. 

The  EA  modulator  was  also  driven  with  40GHz  7Vpp  dual  RF 
signals.  Fig.  3  shows  the  pulsewidths  obtained  as  a  function  of 
reverse  bias.  A  minimum  pulsewidth  of  3.6ps  with  an  optical 
bandwidth  of  0.83nm  was  achieved  at  -5V  (insets  to  Fig.  3), 
which  corresponds  to  a  duty  ratio  of  14.4%.  These  results  indicate 
that  160  to  40 Gbit/s  optical  demultiplexing  should  be  feasible  with 
an  optical  power  penalty  of  <  1  dB  [8]. 

Summary:  We  have  successfully  demonstrated  optical  short  pulse 
generation  at  frequencies  >  30GIIz  using  a  dual-drive  scheme  in  a 
travelling-wave  EA  modulator.  Sub-4 ps  pulses  were  obtained  with 
low  polarisation  sensitivity  (0.3 ps),  high  dynamic  extinction  ratio 
(>  20dB),  high  optical  input  power  (+7.5dBm)  and  high  average 
optical  output  power  (>  -25 dBm). 
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MOST  program. 
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Interferometric  crosstalk  reduction  by 
phase  scrambling  in  WDM  integrated  cross¬ 
connects 

E.  Tangdiongga,  R.  Jonker,  H.  de  Waardt  and 
G.D.  Khoc 

Interferometric  crosstalk  mitigation  in  a  four-channel  2.5GbilA 

InP-based  2X2  cross-connect  using  phase  scrambling  is  reported. 

Bit  error  rate  performance  is  improved  from  a  large  power 

penalty  indicated  by  a  floor  at  10  s  to  a  penalty  of  <  IdB. 

Introduction:  A  phase  scrambling  (PS)  technique  has  been  investi¬ 
gated  as  a  means  for  mitigating  the  detrimental  effects  of  interfer¬ 
ometric  noise  in  optical  links  [l].  This  type  of  noise  may  occur  in 
integrated  wavelength-selective  devices  such  as  InP-bascd  optical 
cross-connects  (OXCs).  Owing  to  the  compact  size  of  a  few  milli¬ 
metres  and  the  switching  speed  of  a  few  nanoseconds,  the  InP- 
bascd  OXC  is  very  attractive  for  packet  switching  applications.  As 
a  disadvantage,  InP-based  OXCs  still  show  moderate  crosstalk 
levels  [2],  although  considerable  improvements  have  been  achieved 
recently  [3].  A  theoretical  study  of  PS  for  a  single-channel  point- 
to-point  transmission  has  been  published  in  [4].  In  this  Letter,  wc 
report  for  the  first  time  the  application  of  the  PS  technique  to  a 
multi-channel  2x2  InP-based  OXC  in  order  to  improve  its  per¬ 
formance.  Without  the  PS,  a  2.5Gbit/s  bitrate  transmission 
showed  poor  performance  due  to  interferometric  crosstalk  and  bit 
error  rate  (BER)  floors  occured  at  1CH\  By  using  the  PS.  error-free 
transmission  with  a  penalty  of  <  0.5dB  is  obtained.  This  result 
demonstrates  dearly  the  potential  of  the  PS  technique  in  WDM 
networks  employing  OXCs  for  which  the  crosstalk  performance 
does  not  yet  fully  comply  with  the  stringent  telecom  requirements. 

Experimtmtal  setup:  A  four-channel  integrated  InP-based  OXC 
was  placed  in  the  experimental  setup  (Fig.  I).  Four  DFB  lasers 
provided  CW  sources  at  wavelengths  of  1551.0,  1554.2,  1557.4  and 
1560.6nm.  Pseudorandom  nonreturn-lo-zero  (NRZ)  data  of  a 
sequence  length  of  23)  -  1  was  encoded  at  a  bit  rate  of  2.5 Gbit/s 
using  an  external  modulator  to  generate  optical  signals  with  nar¬ 
row  spectra.  The  four  channels  were  subsequently  scrambled  in 
phase  by  the  phase  scrambler  .section  to  broaden  their  spectra,  and 


amplified  by  an  EDFA  before  being  split  to  create  two  paths  for 
feeding  both  input  ports  of  the  OXC.  To  obtain  two  uncorrelatcd 
input  signals,  we  inserted  a  delay  fibre  in  one  arm  before  the 
input.  The  delay  fibre  was  chosen  to  be  much  longer  than  the 
coherence  length  of  each  laser  source.  Two  polarisauon  controller 
were  used  to  maximise  the  detrimental  effects  of  interferometric 
beating  noise.  The  combination  of  the  power  splitter  and  polarisa¬ 
tion  controllers  created  a  worst-case  condition  in  the  setup:  wave¬ 
length  and  polarisation  alignment.  The  experimental  results 
represent,  therefore,  the  worst-case  crosstalk  performance  that 
may  occur  in  WDM  networks.  To  couple  the  signals  into  and  out 
of  the  OXC,  we  adopted  the  same  technique  as  [5].  After  travelling 
through  the  single-phase  array  OXC,  the  channels  were  amplified 
to  compensate  for  fibre-lo-fibre  losses.  The  BER  evaluation  for 
each  channel  was  performed  by  an  optical  demultiplexer  (band¬ 
width  90GHz)  for  channel  selection  and  a  variable  attenuator 
before  the  receiver  for  input  power  adjustment.  The  receiver  con¬ 
sisted  of  an  InGaAs  pin  photodiode  followed  by  a  variable  gain 
electrical  amplifier  to  boost  the  photocurrent.  The  electrical  band¬ 
width  of  the  receiver  circuit  is  1. 8 GHz,  which  is  sufficient  to 
detect  2.5 Gbit/s  signals  without  significant  signal  distortion.  The 
phase  scrambler  section  was  realised  by  using  a  phase  modulator 
driven  by  a  noise  signal.  The  noise  signal  was  made  by  mixing  a 
200 MHz  band-limited  white  noise  source  with  an  RF  signal.  The 
obtained  noise  signal  caused  a  phase  deviation  of  the  value  n  and 
it  was  centred  at  the  RF  frequency  of  2.5GHz.  The  spectrum  of 
the  2.5  Gbit/s  signal  due  to  the  PS  is  shown  in  Fig.  2.  Compared 
to  the  original  spectrum,  there  is  a  phase  scrambler  induced  spec¬ 
tral  broadening  of  75pm  (measured  at  -20dB).  This  spectral 
broadening  will  cause  an  additional  penalty  of  <  IdB  after  200km 
standard  fibre  due  to  chromatic  dispersion  [4], 


Fig.  1  Experimental  setup 


k.  nm 


[349/2 1 

Fig.  2  Signal  spectrum 


(i)  due  to  phase  scrambling 

(ii)  original  spectrum 


Results:  The  penalties  due  to  interferometric  crosstalk  in  the  OXC 
were  measured  by  taking  input  powers  corresponding  to  a  BER 
value  of  10~9.  As  a  reference,  the  BER  of  a  scheme  without  cross¬ 
talk  (only  one  input  port  being  used)  was  used.  Measurement  of 
the  statical  transmission  properties  of  the  OXC  showed  that  the 
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was  achieved  at  room  temperature.  No  penalty  was  observed  after 
transmission  over  10  km  of  standard  singlemode  fibre. 


▲  back  to  back  at  room  temperature 
■  back  to  back  at  T  =  65°C 
•  over  4km  of  singlemode  fibre  at  T  =  65°C 


To  investigate  the  effects  of  temperature,  transmission  experi¬ 
ments  at  a  transmitter  case  temperature  of  65°C  were  performed. 
Fig.  5  shows  the  measured  bit  error  rate  performance  at  room 
temperature,  and  at  65°C  for  a  back-to-back  configuration  and 
over  4km  of  singlemode  fibre,  respectively.  The  increase  in  tem¬ 
perature  results  in  a  power  penalty  of  ~1.2dB,  while  transmission 
over  4km  of  standard  singlemode  fibre  at  650°C  yields  another 
penalty  of  ~0.5dB.  The  IdB  difference  in  sensitivity  at  room  tem¬ 
perature  compared  to  Fig.  4  is  due  to  the  use  of  a  different 
receiver  module  with  a  reduced  fibre-to-chip  coupling  efficiency-  in 
this  transmission  experiment. 

Conclusions:  We  have  presented  a  lOGbit/s  transmission  system 
using  an  uncooled  MQW  ridge  waveguide  laser  as  a  transmitter. 
We  achieved  a  sensitivity  of  -13.0dBm  at  a  transmitter  case  tem¬ 
perature  of  65°C  over  4km  of  standard  singlemode  fibre.  Using 
standard  T046  packages  for  the  transmitter  and  receiver  offers  low 
cost  potential,  which  is  essential  for  future  high-speed  datacom- 
munications  applications. 
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120Gbit/s  OTDM  system  using 
electroabsorption  transmitter  and 
demultiplexer  operating  at  30GHz 

V.  Kaman  and  J.E.  Bowers 

The  authors  describe  a  120  Gbit/s  (4  x  30Gbit/s)  OTDM  system 
in  which  the  transmitter  and  the  receiver  are  based  on 
electroabsorption  modulators  operating  at  30GHz.  Error-free 
operation  with  an  average  120Gbit/s  sensitivity  of  -22.6dBm  is 
achieved. 


Introduction:  Sinusoidally  driven  electroabsoiption  (EA)  modula¬ 
tors  have  recently  attracted  much  attention  in  high-speed  optical 
time  division  multiplexed  (OTDM)  systems  as  optical  short  pulse 
generators  and  optical  demultiplexers  [1,  2].  An  80Gbit/s  OTDM 
data  stream  (with  lOGbit/s  base  rate)  was  realised  by  short  pulses 
generated  from  EA  modulators  without  using  any  nonlinear  pulse 
compression  [1].  Conversely,  a  160  Gbit/s  optically  multiplexed 
data  stream  was  demultiplexed  to  lOGbit/s  using  two  concate¬ 
nated  EA  modulators  [2].  Owing  to  advances  in  high-speed  electri¬ 
cal  TDM,  it  is  inevitable  that  next  generation  OTDM  systems  will 
operate  at  a  base  rate  of  40Gbit/s,  with  optical  multiplexing  to 
160 Gbit/s  or  more  [3,  4],  The  increase  of  the  base  rate  and  the 
consequent  reduction  in  the  number  of  optical  channels  should 
allow  for  more  robust  and  stable  high-speed  OTDM  systems  [5]. 

Recently,  we  demonstrated  sub-4ps  pulse  generation  using  a 
single  EA  modulator  at  modulation  frequencies  >  30GHz  [6]  as 
well  as  integrated  tandem  EA  modulators  for  >  lOOGbit/s  optical 
demultiplexing  [7].  In  this  Letter,  we  demonstrate  a  120Gbit/s  (4  x 
30Gbit/s)  OTDM  system  based  on  30GHz  modulation  of  these 
devices.  We  believe  that  this  is  the  first  demonstration  of  the  feasi¬ 
bility  of  using  EA  modulators  in  conjunction  with  high-speed 
(>  20  Gbit/s)  electrical  multiplexing  and  demultiplexing  for 
>  lOOGbit/s  OTDM  systems. 


Fig.  1  Experimental  setup  for  120  Gbit/s  OTDM  system 
Eye  diagrams  -  !3.3ps/div 
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Experimental  setup  and  results:  The  experimental  setup  for  the 
120Gbit/s  OTDM  system  is  shown  in  Fig.  1.  The  base  rate  of 
30 Gbit/s  was  generated  by  electrically  multiplexing  two  7.5Gbit/s 
chaimels  with  pattern  lengths  of  231-1,  followed  by  optical  multi¬ 
plexing  to  30 Gbit/s.  The  30Gbit/s  optical  signal  of  2dBm  was  then 
coupled  into  the  dual-drive  EA  modulator  [6),  which  has  a  maxi¬ 
mum  extinction  ratio  of  40.3dB  and  a  bandwidth  of  26GHz.  The 
modulator  was  reverse  biased  at  -6V  and  driven  by  two  30GHz 
RF  amplifiers  (7Vpp)  which  were  synchronised  to  the  30Gbit/s 
data  stream  by  electrical  delay  lines.  Transform-limited  4ps  pulses 
were  generated  and  external  compression  was  not  employed  after 
the  modulator.  The  30 Gbit/s  signal  was  then  passively  multiplexed 
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(with  200  bits  of  delay  between  channels)  to  generate  the  120Gbit/s 
OTDM  data  stream.  Both  time  and  polarisation  multiplexing  were 
employed  at  the  last  stage  of  the  multiplexer  to  minimise  coherent 
interference  between  adjacent  channels.  Fig.  2  shows  the  two 
60Gbit/s  optical  channels  measured  using  a  30  GHz  photodiode. 


Fig.  2  Two  60 Gbit/s  channels  measured  with  30 GHz  photodiode 
13.3ps/div 


•  120Gbit/s 
O  60  Gbit/s 

Inset:  Receiver  sensitivity  at  10“9  for  all  16  channels 

In  the  receiver,  the  1 20 Gbit/s  data  stream  was  optically  pream¬ 
plified  and  fed  into  the  optical  demultiplexer  which  consisted  of 
integrated  tandem  EA  modulators  [7].  The  two  modulators  (EA1 
and  EA2)  are  300pm  and  400pm  long,  respectively,  and  are  sepa¬ 
rated  by  a  20pm  long  optical  waveguide  region.  H+  ion-implanta¬ 
tion  was  used  between  the  modulators  to  reduce  electrical 
crosstalk  and  capacitance.  The  insertion  loss  of  the  device  was 
14.2dB  while  the  maximum  extinction  ratio  was  ~50dB.  Both 
modulators  were  driven  by  30  GHz  RF  signals  and  a  switching 
window  of  5  to  6ps  was  achieved  over  a  wide  range  of  reverse 
biases.  The  output  in  Fig.  1  shows  the  four  demultiplexed  30  Gbit/s 
optical  channels.  A  polarisation  sensitivity  of  -2dB  was  observed 
which  had  a  minimal  impact  on  the  30  Gbit/s  eye  opening.  The 
received  30  Gbit/s  eye  diagrams  also  indicate  that  farther  high¬ 
speed  electrical  demultiplexing  is  feasible.  It  should  also  be  noted 
that  a  similar  dual-drive  EA  modulator  used  in  the  transmitter 
could  have  been  employed  as  the  optical  demultiplexer. 

To  simplify  bit  error  rate  (BER)  measurements,  EA2  was  only 
driven  by  a  7.5GHz  RF  signal  and  the  120Gbit/s  data  stream  was 
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directly  demultiplexed  to  7. 5  Gbit/s.  Fig.  3  shows  the  120Gbit/s 
and  60Gbit/s  BER  measurements  for  an  arbitrarily  selected. chan¬ 
nel.  Error-free  operation  with  a  120Gbit/s  receiver  sensitivity  of  - 
22.2dBm  was  achieved.  The  inset  to  Fig.  3  shows  the  sensitivities 
of  all  16  channels  for  a  BER  of  10~9  (mean  of  -22,6dBm)  while 
Fig.  4  shows  all  the  demultiplexed  channels. 
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Fig.  4  Eye  diagrams  for  all  16  demultiplexed  channels 
13. 3  ps/di  v 

Summary :  We  have  successfully  demonstrated  a  four-channel 
120Gbit/s  OTDM  system  that  reduces  the  number  of  optical  chan¬ 
nels  by  increasing  the  base  rate  to  30  Gbit/s.  This  is  achieved  by 
employing  electroabsorption  modulators  operating  at  30GHz  for 
puise  generation  and  optical  demultiplexing.  Further  operation  at 
a  base  rate  of  40Gbit/s  for  an  aggregate  data  rate  of  160Gbit/s 
should  be  feasible  with  these  devices. 
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8  x  40 Gbit/s  transmission  over  640km  of 
large-effective-area  nonzero-dispersion 
shined  fibre  with  an  input  power  tolerance 
greater  than  7  dB 

H.  Murai,  H.T.  Yamada,  A.R.  Pratt  and  Y.  Ozeki 

40Gbit/s  transmission  is  reported  over  640km  of  large-effective- 
area  nonzero  dispersion  shifted  fibre  for  eight  wavelength  division 
multiplexed  channels  with  a  channel  spacing  of  100GHz  and  an 
input  power  tolerance  greater  than  7dB. 

Introduction :  Reducing  nonlinear  effects  in  high-speed,  dense 
wavelength  division  multiplexing  (DWDM)  transmission  systems 
has  become  increasingly  more  important  as  bit  rates  evolve  to 
keep  pace  with  bandwidth  demand.  Whereas  the  dispersion  of  the 
transmission  fibre  can  be  compensated  for  using  standard  disper¬ 
sion  compensating  techniques  [1],  fibre  nonlinearity  increases  pro¬ 
portionally  with  bit  rate  and  represents  the  ultimate  limit  to 
system  performance.  Large-effective-area  nonzero-dispersion 
shifted  (NZDS)  fibre  has  an  increased  light-carrying  cross-section 
and  can  potentially  support  higher  input  powers,  while  at  the 
same  time  suppressing  four  wave  mixing  (FWM),  cross-phase 
modulation  (XPM)  and  pulse  distortion  due  to  self-phase  modula¬ 
tion  (SPM)  [2].  Reducing  the  fibre  nonlinearity  facilitates  linear- 
like  transmission  at  higher  bit  rates.  This  improves  the  transmis¬ 
sion  characteristics  since  the  fibre  dispersion  can  be  almost  com¬ 
pletely  compensated  using  the  appropriate  length  of  dispersion 
compensating  fibre  (DCF).  In  this  Letter  we  demonstrate  for  the 
first  time  the  feasibility  of  transmitting  a  40  Gbit/s  retum-to-zero 
(RZ)  signal  over  eight  SO  km  spans  of  large-effective-area  nonzero- 
dispersion  shifted  fibre.  Eight  multiplexed  wavelength  channels, 
with  a  channel  spacing  of  100 GHz,  have  been  successfully  trans¬ 
mitted,  error  free,  over  a  total  transmission  distance  of  640km 
with  an  input  power  tolerance  greater  than  7dB. 


broadband  narrowband 
DCF  40Gblt/s  40Hz 


Experiment :  Fig.  1  shows  the  experimental  setup.  Eight  100  GHz¬ 
spaced  wavelength  channels,  ranging  from  1547.0nm  to  1552.6nm, 
were  combined  and  modulated  using  two  high-speed  InGaAsP- 
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based  multiple  quantum  well  electroabsorption  (EA)  modulators 
[3,  4],  to  generate  a  40Gbit/s  optical  pulse  stream.  Data  encoding 
at  40  GBit/s,  with  a  215  —  1  pseudorandom  binary  sequence 
(PRJBS)  was  performed  using  the  second  EA  modulator.  Four 
lOGbit/s  signals,  each  encoded  with  215  -  1  PRBS,  were  electri¬ 
cally  multiplexed  to  generate  the  3.0Vpp  40Gbit/s  modulation  sig¬ 
nal.  The  40Gbit/s  RZ  signal  was  amplified  and  the  wavelength 
channels  decorrelated  using  0.75km  of  DCF  (-50ps/nm),  which 
also  introduced  pre-chirp  prior  to  transmission.  The  recirculating 
loop  experiment  consisted  of  two  amplifier  spans,  comprising 
80km  of  NZDS  fibre  with  an  effective  area  of  75pm2  and  an  aver¬ 
age  dispersion  of  3.8ps/nno/km  (at  1549nm).  An  additional  ampli¬ 
fier  was  included  in  each  span  to  prevent  signal-tonoise  ratio 
(SNR)  degradation  and  to  maintain  a  power  level  of  ~-5dBm 
before  launching  into  the  dispersion  compensating  fibre  (DCF). 
4km  of  DCF,  with  a  typical  average  dispersion  of  ~80ps/nm/km, 
was  inserted  at  the  end  of  each  span.  The  wavelength  correspond¬ 
ing  to  zero  average  dispersion  was  set  to  1549.1mn.  After  trans¬ 
mission,  each  wavelength  channel  was  demultiplexed  using  a 
tunable  optical  bandpass  filter  with  a  3dB  bandwidth  of  ~60GHz 
(0.5nm).  A  length  of  DCF  was  also  adjusted  outside  the  loop  to 
individually  compensate  the  accumulated  dispersion  at  each  wave¬ 
length.  The  transmission  of  the  individual  channels  was  evaluated 
by  optically  demultiplexing  the  40Gbit/s  data  stream  to  lOGbit/s 
using  two  20Gbit/s  EA  modulators.  An  electronic  phase-locked 
loop  (PLL)  clock  recovery  setup  was  used  to  drive  the  EA  modu¬ 
lators  and  act  as  a  trigger  for  the  bit  error  rate  (BER)  measure¬ 
ments. 


Fig.  2  Single-channel,  40Gbit/s  transmission 
Wavelength 

□  1549.4nm  (640km)  ■  1549.5nm  (1120km) 

A  1 552.6 nm  (640km)  A  1552.6nm  (1120km) 

O  1547.0nm  (640km)  •  I547.0nm  (1120km) 


Results  and  discussions:  Fig.  2  shows  the  BER  characteristics  for 
single-channel  40  Gbit/s  RZ  transmission  at  three  wavelengths  of 
1547.0nm,  1549.4mn  and  1552.6nm.  The  results  clearly  demon¬ 
strate  that  error-free  transmission  (corresponding  to  a  BER  <  10~9) 
was  achieved  after  a  total  transmission  distance  of  1 120km  (equiv¬ 
alent  to  fourteen  80km  spans  of  NZDS  fibre)  for  all  but  the  long¬ 
est  wavelength  channel.  Owing  to  the  relatively  large  dispersion 
slope  of  the  NZDS  fibre  GO.  1  ps/nm2/km),  the  accumulated  disper¬ 
sion  was  largest  at  1552.6nm,  and  the  transmission  at  this  wave¬ 
length  was  limited  by  nonlinear  effects.  The  launch  power  per 
channel  was  also  varied  to  determine  the  input  power  tolerance 
afforded  by  the  large-effective-area  fibre.  Gose  to  the  zero-disper¬ 
sion  wavelength,  the  eye  patterns  showed  clear  openings  over  an 
input  range  of  lOdBm,  indicating  that  nonlinear  effects  did  not 
limit  the  transmission.  At  a  transmission  distance  of  640km,  error- 
free  transmission  was  achieved  at  all  three  wavelengths  with  a  sim¬ 
ilar  input  power  tolerance  of  lOdB.  It  is  worth  noting  that  the  EA 
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In  conclusion,  in  this  Letter  a  new  learning  algorithm  for  pat¬ 
tern  classification  has  been  developed.  The  authors  have  shown 
that  the  performances  of  the  proposed  approach  are  better  than 
those  obtained  using  well  known  pattern  classification  techniques. 
It  is  worth  noting  that  the  choice  of  the  neural  model  has  been 
motivated  by  the  fact  that,  differently  from  other  neural  architec¬ 
tures.  CNNs  make  feasible  the  implementation  of  VLSI  devices 
for  real-time  vision  [6].  Therefore,  this  Letter  introduces  the  possi¬ 
bility  of  implementing  neural  devices  for  efficient  pattern  classifi¬ 
cation. 
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Compact  40Gbit/s  demultiplexing  receiver 
based  on  integrated  tandem 
electroabsorption  modulators 

V.  Kaman,  Y.-J.  Chiu,  T.  Liljeberg,  S.Z.  Zhang  and 
J.E.  Bowers 


receiver,  but  also  provides  efficient  coupling  without  the  need  for 
additional  optical  components. 

In  this  Letter,  we  demonstrate  a  compact  40 Gbit/s  demultiplex¬ 
ing  receiver  based  on  integrated  tandem  EA  modulators.  The  fea¬ 
sibility  of  this  device  for  80  to  20  Gbit/s  demultiplexing  and 
photodetection  is  also  presented. 


Fig.  1  Single-channel  lOGbit/s  performance  of  demultiplexing  receiver 
for  BER  of  10~9  as  function  of  optical  input  power 

Inset:  Schematic  diagram  of  device  operation  (EA  -  electroabsorption 
modulator) 


Device  characteristics:  The  tandem  EA  modulators  (300  and 
400pm  long)  are  based  on  a  travelling-wave  electrode  structure 
with  ten  periods  of  strain-compensated  InGaAsP  quantum  wells 
fabricated  by  metal-organic  chemical  vapour  deposition 
(MOCVD)  on  semi-insulating  InP  substrate  [7].  The  400pm-long 
device  (the  optical  demultiplexer)  achieved  a  maximum  extinction 
of  38  dB  at  -6V  while  the  300pm-long  device  was  used  as  a  reverse 
biased  photodetector  with  a  responsivity  of  0.5A/W.  The  two 
modulators  were  separated  by  a  20fim-long  optical  waveguide 
defined  by  H+  ion  implantation,  which  also  extended  50pm  into 
each  modulator  to  reduce  capacitance  and  microwave  crosstalk. 
The  measured  impedance  and  the  microwave  crosstalk  were  50kQ 
and  <  -30 dB,  respectively,  while  the  optical  insertion  loss  of  the 
ion-implanted  region  was  estimated  to  be  2.3  dB.  Both  modulators 
were  terminated  in  thin-film  resistors  and  dielectric  capacitors, 
which  reduced  heating  effects  and  allowed  for  long-term  operation 
of  the  tandem  without  any  external  temperature  cooling.  The 
400pm-long  device  had  a  modulation  bandwidth  of  16  GHz  while 
the  300pm  device  had  a  photodetection  bandwidth  of  18GHz, 
which  is  suitable  for  20Gbit/s  detection  applications. 


A  compact  40 Gbit/s  demultiplexing  receiver  based  on  integrated 
tandem  clcctroabsorplion  modulators  is  described.  Error-free 
operation  and  a  receiver  sensitivity  of -27 dBm  are  achieved,  while 
the  feasibility  of  this  device  as  an  80Gbit/s  demultiplexing  receiver 
is  reported. 

Introduction:  High-speed  demultiplexing  is  one  of  the  key  features 
in  high  bit  rate  time-division-multiplexed  transmission  systems. 
Demultiplexing  based  on  high-speed  integrated  circuit  technology 
has  been  demonstrated  at  60Gbit/s  [I];  however,  it  has  been  lim¬ 
ited  to  40 Gbit/s  in  fibre  transmission  experiments  [2,  3].  Con¬ 
versely,  optical  demultiplexing  using  a  sinusoidally-driven 
electroabsorption  (EA)  modulator  has  emerged  as  an  alternative 
approach  to  electrical  demultiplexing  [4].  with  demonstrations  at 
80 Gbit/s  [5]  and  160Gbit/s  [6].  The  major  drawback  with  this 
technique  is  the  high  insertion  (coupling)  losses  in  EA  modulators, 
which  requires  the  demultiplexed  optical  channel  to  be  externally 
amplified  before  detection  using  a  receiver.  There  are  two  conse¬ 
quences  to  this:  (i)  each  channel  to  be  demultiplexed  requires  an 
optical  amplifier  and  a  receiver,  which  increases  the  cost  and  the 
complexity  of  the  receiver  node,  and  (ii)  environmentally  induced 
timing  asynchronisalion  between  the  demultiplexer  and  the  clock- 
recovery  unit  can  lead  to  possible  signal-to-noise  ratio  (SNR)  dete¬ 
rioration.  Therefore,  it  is  desirable  to  integrate  the  demultiplexer 
and  the  photodetector,  which  not  only  allows  for  a  compact 


Fig.  2  BER  curves 

9.2 dBm  optical  input  power  to  tandem  device 

•  lOGbit/s  single  channel 

□  40  Gbit/s  data  stream 

Inset:  (i)  40 Gbit/s  optical  input  (20ps/div) 

(ii)  lOGbit/s  demultiplexed  and  received  channel  (50 ps/di v) 
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Results:  The  operation  of  the  tandem  EA  modulators  as  a  demul¬ 
tiplexing  receiver  is  shown  as  an  inset  to  Fig.  1.  The  40Gbit/s 
optical  lime-division  multiplexed  (OTDM)  data  stream  was  real¬ 
ised  by  passively  multiplexing  with  lm-Iong  fibre  dela\  lines  of 
lOGbit/s  data-encoded  (27  -  1  pattern  length)  8ps  pulses,  which 
were  generated  by  a  harmonically  driven  EA  modulator  at 
1555nm.  The  40  Gbit/s  data  stream  was  then  optically  preampli¬ 
fied  before  it  was  coupled  into  the  tandem  receiver.  The  400pm 
EA  modulator  was  reverse  biased  at  -4.8V  and  driven  by  a  6V 
10GHz  RF  signal  to  generate  a  14ps  switching  window,  which 
was  synchronised  to  the  desired  optical  channel  by  an  electrical 
delay  line.  The  demultiplexed  optical  channel  was  then  detected  by 
the  300pm  devioc  biased  at  -8V. 

Single-channel  lOGbit/s  bit  error  rate  (BER)  measurements 
were  performed  in  order  to  determine  the  photodetection  perform¬ 
ance  of  the  tandem  device.  Fig.  1  shows  the  average  received 
power  (at  the  input  of  the  optical  preamplifier)  required  for  a 
BER  of  10~9  as  a  function  of  the  optical  input  power  into  the  tan¬ 
dem.  Degradation  in  receiver  sensitivity  is  observed  for  low  input 
powers  owing  to  the  low  SNR  at  the  photodetector  while  no 
power  penalty  is  acquired  at  high  input  powers.  It  is  important  to 
mention  that  a  2dB  optical  power  penalty  was  observed  for  a  pat¬ 
tern  length  of  231  -  1,  which  was  determined  to  be  due  to  electrical 
reflections  from  the  RF  termination  of  the  300pm  photodetector 
device.  Fig.  2  shows  the  BER  measurements  at  10  and  40Gbit/s. 
Error-free  operation  was  achieved  for  40Gbit/s  operation  with  a 
receiver  sensitivity  of  -27dBm.  The  insets  to  Fig.  2  show  the 
40 Gbit/s  optical  input  data  stream  and  the  received  lOGbit/s 
channel. 


reverse  bias,  V 


3  PElse  width  as  faction  of  reverse  bias  at  20GHz  modulation  for 
400 fjm  EA  modulator 


Inset:  OsGiUcscopc  trace  of  pulses  measured  using  40GHz  photodetec¬ 
tor  (20ps/div) 


To  investigate  the  possibility  for  80Gbit/s  operation,  the  400pm 
demultiplexing  modulator  was  driven  with  a  6.4V^  20GHz  RF 
signal  and  a  CW  input  at  1555nm.  The  output  pulses  were  then 
measured  on  a  second  harmonic  generation  autocorrelator  and 
deconvolved  assuming  a  Gaussian  pulse  shape  as  inferred  from  the 
optical  spectrum  measurements.  Fig.  3  shows  the  pulse  widths 
(FWHM)  obtained  as  a  function  of  reverse  bias  while  the  dynamic 
extinction  ratio  was  estimated  to  be  >  20dB.  Pulse  widths  as  short 
as  6ps  were  achieved,  which  indicates  that  penalty-free  demulti¬ 
plexing  of  80Gbit/s  data  stream  is  possible  [5,  8]. 


Summary:  We  have  successfully  demonstrated  error-free  40Gbit/s 
operation  of  a  compact  demultiplexing  receiver  based  on  inte¬ 
grated  tandem  EA  modulators.  Our  results  indicate  that  these 
devices  can  be  used  as  efficient  receivers  at  bit  rates  up  to  80Gbitfs. 
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Demonstration  of  high  robustness  to  SNR 
impairment  in  20Gbit/s  long-haul 
transmission  using  1.5|xm  saturable 
absorber 

O.  Leclerc,  G.  Aubin,  P.  Brindel,  J.  Mangeney, 

H.  Choumane,  S.  Barre  and  J.-L.  Oudar 

Improved  robustness  to  signal-to-noise  ratio  impairment  is  shown 

for  the  first  time  at  20Gbit/s  in  an  RZ  transmission  loop  using  a 

1.5pm  heavy-ion-irradiated  quantum  well  saturable  absorber. 

Error-free  transmission  is  achieved  over  7800km. 

Introduction:  Passive  optical  regeneration  using  saturable  absorb¬ 
ers  can  contribute  to  increased  bit-rate  or  error  free  distance  in 
signal-to-noise  ratio  (SNR)-limited  transmission  systems  [1,  2].  We 
recently  showed  the  efficiency  of  a  multi-quantum  well  (MQW) 
saturable  absorber  (SA)  in  a  lOGbit/s  RZ  long-haul  transmission 
[3].  In  this  Letter  we  report  the  ability  of  such  an  ultra-fast  passive 
device  to  reduce  amplified  spontaneous  emission  (ASE)  accumula¬ 
tion  at  20Gbit/s  in  a  1.55pm  loop  experiment,  yielding  both 
increased  transmission  distances  and  improved  system  robustness. 

Saturable  absorber  characteristics:  The  vertical  cavity  structure  of 
the  MQW  semiconductor  saturable  absorber  is  that  detailed  in  [3]. 
The  device  is  used  in  reflection  mode  and  is  polarisation-insensi¬ 
tive  thanks  to  the  normal  incidence  of  the  input  light-wave.  Under 
low  repetition  rate  of  short  pulses,  it  was  shown  to  exhibit  an  on/ 
off  contrast  ratio  of  10:0  and  a  reflectivity  of  40%  with  a  satura¬ 
tion  energy  <  2pJ.  Irradiated  by  II  MeV  Ni+  ions,  with  an  irradia¬ 
tion  dose  of  1  x  10ncm-2,  it  yields  a  response  time  of  I  Ops  [4  -  6]. 

The  discrimination  properties  of  the  SA  have  been  analysed  at 
higher  frequency  using  puiposely  degraded  20 GHz  RZ  signals 
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Simultaneous  OTDM  Demultiplexing  and  Detection 
Using  an  Electroabsorption  Modulator 

Volkan  Kaman,  Student  Member,  IEEE ,  Adrian  J.  Keating,  Member,  IEEE ,  Sheng  Z.  Zhang,  Member,  IEEE ,  and 

John  E.  Bowers,  Fellow,  IEEE 


Abstract — A  traveling-wave  electroabsorption  modulator  is 
used  to  simultaneously  demultiplex  and  electrically  detect  a  single 
10-Gb/s  channel  from  a  20-Gb/s  optical  time-division  multiplexed 
data  stream  while  transmitting  the  other  channel  in  an  optically 
transparent  manner.  A  bit-error-rate  penalty  of  05  dB  and 
an  error  floor  were  observed  for  the  dropped  channel  due  to 
residual  absorption  of  the  other  channel.  Error-free  operation 
was  achieved  for  the  transparent  channel. 

Index  Terms — Demultiplexing,  electroabsorption,  optical  fiber 
communication,  optical  switches,  photodetectors,  traveling  wave 
devices. 


I.  Introduction 

LECTRO ABSORPTION  (EA)  modulators  are  suited  to 
optical  time-division-multiplexing  (OTDM)  network  ap¬ 
plications  as  their  nonlinear  transfer  function  permits  narrow  op¬ 
tical  switching  windows  with  large  extinction  ratios.  Recently, 
there  have  been  several  demonstrations  of  key  processing  fea¬ 
tures  required  in  OTDM  systems,  such  as  optical  short  pulse 
generation  [1],  optical  demultiplexing  [2],  [3],  optical  regenera¬ 
tion  [4],  [5]  and  drop  and  insert  multiplexing  [6]  using  EA  mod¬ 
ulators.  The  ability  to  drop  a  single  channel  from  a  high-speed 
OTDM  data  stream  is  significant  at  a  network  node  since  op¬ 
tical  demultiplexing  is  performed  while  still  transmitting  the 
other  channels  in  an  optically  transparent  manner  for  further 
processing.  This  has  the  advantage  of  optical  power  conserva¬ 
tion  in  contrast  to  standard  optical  demultiplexing  (for  example, 
using  an  EA  modulator)  in  which  the  other  channels  are  sup¬ 
pressed  [2].  It  is  also  desirable  to  perform  the  demultiplexing 
(or  channel  drop)  and  optical-to-electrical  (O/E)  conversion  si¬ 
multaneously  using  a  single  device  in  order  to  reduce  the  cost 
and  complexity  of  the  node.  EA  modulators  are  ideal  for  this 
application  since  they  can  be  used  as  optical  switches  and  pho¬ 
todetectors  due  to  their  absorptive  characteristic. 

In  consideration  of  the  use  of  an  EA  modulator  in  OTDM 
drop  and  detect  applications,  several  crosstalk  issues  need  to  be 
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addressed.  The  electrical  extinction  ratio  (defined  as  the  ratio 
between  the  responsivity  at  high  reverse  bias  and  zero  bias)  has 
to  be  high  enough  to  prevent  absorption  of  the  undropped  chan¬ 
nels  that  would  interfere  with  the  dropped  channel.  If  a  new 
channel  is  to  be  inserted  in  the  available  bit  slot  after  a  channel 
is  dropped,  it  is  also  important  to  have  a  high  optical  extinction 
ratio  to  prevent  interference  between  the  dropped  channel  and 
the  channel  to  be  inserted. 

To  date,  only  the  ability  of  an  EA  modulator  to  drop  a  channel 
without  detection  has  been  demonstrated  [6].  In  this  paper,  we 
demonstrate  for  the  first  time  the  use  of  an  EA  modulator  to 
simultaneously  demultiplex  and  detect  the  desired  channel  from 
an  OTDM  data  stream  while  transmitting  the  other  channel  in 
an  optically  transparent  manner. 

n.  Principle  of  Operation 

The  EA  modulator  used  for  the  OTDM  drop  and  detect 
application  is  a  2.5-^m-wide  300-/xm-long  device  with  trav¬ 
eling-wave  electrodes  as  described  in  reference  [7].  The  3-dB 
bandwidth  was  about  10  GHz  when  operated  as  a  modulator 
at  a  reverse  bias  of  —3  V  and  as  a  photodetector  at  —6  V  (the 
traveling-wave  electrode  is  terminated  in  50  Q,).  The  maximum 
static  optical  and  electrical  extinction  ratios  are  40  dB  and 
28  dB,  respectively,  while  the  fiber-to-fiber  insertion  loss  is 
10  dB. 

Fig.  1  shows  the  operation  principle  of  the  EA  modulator  for 
simultaneous  drop  and  detection.  The  traveling- wave  modulator 
is  a  four-port  device;  the  optical  input  and  output  are  shown  as 
port  1  and  3,  respectively.  The  EA  modulator  is  biased  at  a  high 
reverse  bias  and  a  10-GHz  driving  signal  is  applied  to  port  2 
using  a  high  power  amplifier.  When  the  RF  drive  signal  swings 
negative,  the  EA  modulator  is  in  the  high  absorption  state  and 
any  optical  signal  passing  through  the  modulator  is  absorbed 
and  converted  to  photocurrent.  On  the  other  hand,  when  the  RF 
signal  swings  positive,  the  EA  modulator  switches  to  high  op¬ 
tical  transparency  with  low  absorption.  Port  4  functions  as  the 
10-Gb/s  optical  pulse  detector.  The  output  at  this  port  consists  of 
photogenerated  carriers  (if  data  4T”  is  present)  as  well  as  the  at¬ 
tenuated  10-GHz  driving  RF  signal.  This  signal  is  rejected  from 
the  data  using  a  6.4-GHz  low-pass  filter  (with  60-dB  attenua¬ 
tion  at  10  GHz)  and  reflected  back  into  the  modulator  synchro¬ 
nized  with  the  original  drive  using  an  electrical  phase  shifter. 
The  low-pass  filter  also  converts  the  retum-to-zero  (RZ)  data 
into  nonretum-to-zero  (NRZ)  format  for  bit-error-rate  (BER) 
measurements.  The  alignment  of  the  optical  bits  to  the  electrical 
drive  signal  is  performed  using  an  optical  delay  line.  It  is  sig¬ 
nificant  to  mention  that  when  the  EA  modulator  operates  as  a 
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Fig.  1.  Operation  of  the  OTDM  drop  and  detect  experiment.  Ports  1  and  3  are 
the  optical  input  and  output,  respectively.  Electrical  RF  drive  is  applied  to  port 
2  while  port  4  acts  as  the  10-Gb/s  pulse  detector. 


photodetector,  the  output  impedance  of  the  high-power  RF  am¬ 
plifier  at  port  2  acts  as  the  termination.  Since  our  drive  amplifier 
had  a  low-frequency  cutoff  of  about  10  MHz,  this  limited  our 
experiment  to  a  pseudorandom  bit  stream  of  27  —  1. 


10  Gbit/s 
channel  1  out 


Fig.  2.  Set-up  for  the  20-Gb/s  OTDM  drop  and  detect  experiment 
(PC-polarization  controller). 


Fig.  3.  Eye  diagrams,  (a)  The  20-Gb/s  OTDM  data  stream  input  to  the  EA 
modulator  at  port  1 .  (b)  Optical  transparency  gating  of  the  EA  modulator  aligned 
to  channel  2.  (c)  The  10-Gb/s  channel  2  transmitted  in  an  optically  transparent 
manner  at  port  3.  (d)  Simultaneously  demultiplexed  and  O/E  convened  10-Gb/s 
channel  1  at  port  4. 


ID.  Experimental  Set-Up  and  Results 

The  experimental  setup  for  the  20-Gb/s  OTDM  simultaneous 
channel  drop  and  detect  is  shown  in  Fig.  2.  An  EA  modulator 
harmonically  driven  at  10  GHz  was  used  to  generate  11-ps 
pulses  at  1555  nm.  The  10-Gb/s  data  was  encoded  onto  the 
10-GHz  pulse  train  by  a  LiNb03  modulator.  The  10-Gb/s 
RZ  data  was  then  passively  multiplexed  (with  20  bits  of 
delay  between  channels)  to  generate  a  20-Gb/s  OTDM  data 
stream  [Fig.  3(a)].  The  20-Gb/s  signal  of  about  5  dBm  was 
coupled  into  the  drop  and  detect  EA  modulator  using  an  optical 
preamplifier  followed  by  a  0.6-nm  bandwidth  optical  filter.  A 
polarization  controller  was  used  at  the  input  of  the  modulator 
for  best  operation.  Fig.  3(b)  shows  the  optical  gating  performed 
by  the  modulator  at  a  bias  of  -5  V  with  the  10-GHz  drive 
signal  (10  Vpp)  aligned  to  channel  2  for  optical  transparency. 
Fig.  3(c)  shows  the  optical  output  at  port  3,  where  channel  2  is 
transmitted,  while  channel  1  has  been  completely  removed.  The 
optically  transparent  channel  2  was  then  amplified  before  being 
detected  at  a  commercial  10-Gb/s  receiver.  The  simultaneously 
dropped  and  detected  10-Gb/s  NRZ  converted  channel  1  is 


shown  in  Fig.  3(d).  Sufficient  eye  opening  is  achieved  with 
most  of  the  noise  due  to  the  residual  absorption  of  channel  2. 

Fig.  4  shows  the  BER  measurements  at  10  and  20  Gb/s,  where 
the  average  received  power  was  determined  at  the  input  of  the 
optical  preamplifier  to  the  EA  modulator  for  all  measurements. 
Single-channel  10-Gb/s  BER  measurements  were  performed  in 
order  to  determine  the  photodetection  performance  of  the  EA 
modulator  by  blocking  one  of  the  arms  of  the  optical  multi¬ 
plexer.  The  reference  line  is  when  port  2  was  terminated  in  50  Q 
and  port  4  was  as  illustrated  in  Fig.  1  (the  modulator  was  biased 
at  -5  V).  A  receiver  sensitivity  of  -31  dBm  for  a  BER  of  10" 9 
and  a  slight  error  floor  is  observed.  It  was  determined  that  the 
error  floor  was  due  to  the  high-frequency  limitations  of  the  EA 
modulator  at  this  bias;  at  a  bias  of  — 6  V,  error-free  operation  and 
an  improved  sensitivity  of  -34  dBm  was  achieved  (not  shown). 
Next,  the  50-fl  termination  at  port  2  was  replaced  by  the  driving 
amplifier  with  the  RF  signal  applied  to  the  EA  modulator  at  a 
bias  of  -5  V.  A  negative  penalty  of  1  dB  and  error-free  oper¬ 
ation  is  observed.  This  improvement  at  -5  V  is  simply  due  to 
the  higher  reverse  bias  obtained  from  the  negative  swing  of  the 
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Fig.  4.  BER  curves  for  10-Gb/s  (triangles)  and  20-Gb/s  (circles) 
demultiplexing.  The  filled  and  open  symbols  correspond  to  the  detected 
channel  1  and  transparent  channel  2,  respectively. 

10-GHz  signal  during  the  time  period  of  the  10  Gb/s  channel. 
However,  it  should  also  be  mentioned  that  eye  diagram  deterio¬ 
ration  was  observed  due  to  the  high  noise  figure  and  imperfect 
output  impedance  of  the  driving  amplifier. 

20-Gb/s  BER  measurements  were  then  performed  with  the 
EA  modulator  operated  as  a  simultaneous  demultiplexer  and 
O/E  converter  at  a  bias  of  —5  V.  The  optically  transparent 
channel  2  operated  error-free  with  a  20-Gb/s  sensitivity  of 
-33.5  dBm  and  a  10-Gb/s  sensitivity  (when  channel  1  was 
blocked)  of  -36.5  dBm  (not  shown).  This  result  indicates  the 
good  optical  extinction  of  the  modulator  as  verified  in  the  eye 
diagram  [Fig.  3(c)].  On  the  other  hand,  the  simultaneously 
dropped  and  detected  channel  1  exhibited  a  20-Gb/s  sensitivity 
of  -28.5  dBm  and  an  error  floor.  From  the  single-channel 
10-Gb/s  sensitivity  of  —32  dBm,  one  can  expect  a  20-Gb/s 
sensitivity  of  -29  dBm  for  the  same  channel  if  there  is  no 
penalty.  However,  our  measurements  indicate  that  there  is  a 
power  penalty  of  0.5  dB  and  an  error  floor,  which  we  attribute 
to  the  crosstalk  from  the  residual  absorption  of  channel  2.  We 
believe  that  this  penalty  can  be  reduced  with  an  EA  modulator 
with  sufficient  bandwidth  when  operated  as  a  photodetector  at 
lower  reverse  biases.  This  would  not  only  decrease  the  residual 
absorption,  but  also  allow  for  a  longer  optical  transparency 


window  and  a  shorter  detection  window,  which  would  be  ideal 
for  simultaneous  demultiplexing  and  detection  of  a  single 
channel  from  a  four-channel  40-Gb/s  OTDM  data  stream. 

IV.  Conclusion 

In  summary,  we  have  demonstrated  that  a  single  com¬ 
ponent,  the  traveling-wave  EA  modulator,  can  be  used  to 
simultaneously  demultiplex  and  detect  a  single  channel  from 
a  bit-interleaved  data  stream,  leaving  the  other  channel  unaf¬ 
fected.  An  error  floor  was  incurred  for  the  dropped  channel 
due  to  residual  absorption  of  the  error-free  transparent  channel. 
We  believe  that  this  technique  has  the  potential  to  realize  a 
compact,  high-speed  full  demultiplexing  receiver  (optical-serial 
to  electrical-parallel  conversion)  by  integrating  a  series  of  EA 
modulators  on  a  single  chip. 
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Abstract — Integrated  tandem  traveling-wave  electroabsorption 
modulators  are  demonstrated  as  high-speed  optical  short  pulse 
generators  and  demultiplexers  for  >100  Gbit/s  optical  time-di- 
vision-multiplexed  systems.  The  tandem  significantly  increases 
the  extinction  ratio  and  further  compresses  the  optical  pulses  in 
comparison  to  a  single  modulator.  An  extinction  ratio  of  ~50  dB 
is  achieved  while  optical  pulses  of  4-6  ps  width  at  30-40  GHz  are 
generated. 

Index  Terms  Demultiplexing,  electroabsorption,  optical  fiber 
communication,  optical  switches,  traveling  wave  devices. 


I.  Introduction 

OPTICAL  fiber  transmission  based  on  single  channel  op¬ 
tical  time-division  multiplexing  (OTDM)  has  recently  at¬ 
tracted  a  lot  of  attention  as  a  means  of  upgrading  future  TDM 
systems  [l]-[3].  Due  to  advances  in  high-speed  electrical  TDM, 
it  is  inevitable  that  next  generation  OTDM  systems  will  operate 
at  a  base  rate  of  40  Gbit/s  with  optical  multiplexing  to  1 60  Gbit/s 
or  more  [4].  For  high-speed  OTDM  systems,  sinusoidally  driven 
electroabsorption  (EA)  modulators  have  become  key  devices  as 
optical  short  pulse  generators  and  optical  demultiplexers.  An  80 
Gbit/s  OTDM  data  stream  (with  10  Gbit/s  base  rate)  was  re¬ 
alized  by  short  pulses  generated  from  EA  modulators  without 
using  any  nonlinear  pulse  compression,  which  is  the  highest  ag¬ 
gregate  data  rate  achieved  using  this  technique  to  date  [5].  On 
the  other  hand,  a  160  Gbit/s  optically  multiplexed  data  stream 
was  demultiplexed  to  10  Gbit/s  using  only  EA  modulators  [1]. 

Single  EA  modulators  are  usually  limited  to  ~20  dB  dy¬ 
namic  extinction  ratio,  which  is  sufficient  for  demultiplexing 
purposes,  but  can  lead  to  incoherent  interference  between  mul¬ 
tiplexed  adjacent  pulses  in  OTDM  transmitters  [2].  Therefore, 
a  fiber-coupled  pair  of  separate  modulators  was  used  for  pulse 
generation  in  [5]  and  for  demultiplexing  in  [1].  This  configura¬ 
tion  not  only  effectively  doubles  the  dynamic  extinction  ratio, 
but  also  reduces  the  switching  window.  However,  it  is  desirable 
to  integrate  the  tandem  on  a  single  chip  in  order  to  eliminate  the 
external  optical  amplifier,  which  compensates  for  the  coupling 
losses  between  the  modulators  [6],  [7].  This  results  in  a  com- 
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optical  output 


Fig.  1.  Photograph  of  the  integrated  tandem  traveling- wave  EA  modulators 
(dashed  areas  are  the  ion-implanted  regions). 

pact  and  cost-effective  transmitter  (or  demultiplexer)  as  well  as 
an  environmentally  robust  module. 

In  this  letter,  we  investigate  the  optical  short  pulse  gen¬ 
eration  and  demultiplexing  capability  of  integrated  tandem 
traveling-wave  EA  modulators  at  repetition  frequencies  of 
30  GHz  and  40  GHz  for  >100  Gbit/s  OTDM  systems.  This 
is  also  the  first  demonstration  of  optical  pulse  generation 
using  traveling- wave  EA  modulators,  which  were  previously 
demonstrated  in  a  30  Gbit/s  data  modulation  experiment  [8]. 

II.  Device  Characteristics 

The  EA  modulators  used  for  the  OTDM  application  were 
based  on  a  traveling-wave  electrode  structure  fabricated  with 
MOCVD  grown  ten  periods  of  strain-compensated  InGaAsP 
quantum  wells  on  semi-insulating  InP  substrate  [9].  Trav¬ 
eling-wave  EA  modulators  have  the  advantage  of  overcoming 
the  RC  limitation  (in  comparison  to  lumped  EA  modulators)  re¬ 
sulting  in  longer  devices  with  higher  bandwidths  and  increased 
extinction  ratios.  The  2-/xm  wide,  300-^m  and  400-//m  long  EA 
modulators  were  cleaved  as  a  tandem  (Fig.  1).  The  20-/mi  long 
optical  waveguide  between  the  two  modulators  was  defined  by 
H+  ion  implantation  and  the  measured  impedance  was  50  kfi. 
The  ion  implantation  also  extended  50  /itn  into  each  modulator 
in  order  to  reduce  capacitance  and  microwave  crosstalk  (<—30 
dB);  however,  the  absorption  region  for  each  modulator  was 
shortened  by  100  pm.  Both  modulators  were  terminated  in 
a  thin-film  resistor  and  a  dielectric  capacitor,  which  reduced 
heating  effects  and  allowed  for  long-term  operation  of  the 
tandem  without  any  external  temperature  cooling. 

Fig.  2  shows  the  transmission  characteristics  of  the  tandem 
as  a  function  of  reverse  bias.  An  optical  input  power  of  7  dBm 
was  applied  at  1555  nm.  The  insertion  loss  of  the  tandem  was 
14.1  dB  and  15.3  dB  for  the  TE  and  TM  polarizations,  respec¬ 
tively.  Each  device  was  individually  characterized  by  keeping 
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Reverse  Bias  (V) 


Fig.  2.  Fiber-to-fiber  transmission  versus  reverse  bias  voltage  for  the 
integrated  tandem  EA  modulators.  Solid:  TE  polarization.  Dashed:  TM 
polarization. 

the  other  modulator  at  zero  bias.  The  400-/xm  device  achieved 
a  maximum  extinction  of  38  dB  at  —6  V  while  26  dB  of  extinc¬ 
tion  was  observed  for  the  300-^m  device.  The  difference  in  the 
maximum  extinction  ratios  is  due  to  the  shorter  absorption  re¬ 
gion  of  the  300-^m  device.  It  should  also  be  noted  that  at  high 
reverse  biases,  a  saturation  of  absorption  due  to  the  quantum 
well  excitonic  peak  is  observed  for  both  devices.  Even  though  it 
is  desirable  to  apply  a  high  reverse  bias  in  order  to  generate  short 
switching  windows  using  sinusoidal  modulation,  the  absorption 
saturation  will  deteriorate  the  extinction  ratio  and  generate  sig¬ 
nificant  wings  [10].  These  wings  are  detrimental  for  OTDM 
applications  since  the  resulting  incoherent  interference  in  the 
transmitter  and  the  crosstalk  in  the  receiver  will  significantly  de¬ 
grade  system  performance.  On  the  other  hand,  the  tandem  con¬ 
figuration  shows  an  improved  extinction  ratio  of  ~50  dB  while 
the  absorption  saturation  is  well  suppressed  in  comparison  to 
single  device  operation.  A  14-dB  reduction  in  the  total  expected 
extinction  is  observed,  which  is  attributed  to  higher-order  mode 
coupling. 

in.  Experimental  Results  and  Discussion 

The  optical  switching  capability  of  the  tandem  was  first  char¬ 
acterized  at  30  GHz.  Both  modulators  were  driven  with  7VPP  si¬ 
nusoidal  RF  signals,  which  were  synchronized  by  an  electrical 
delay  line.  The  widths  of  the  optical  pulses  were  measured  using 
a  second  harmonic  generation  autocorrelator  and  deconvolved 
assuming  a  gaussian  pulse  shape  as  inferred  from  the  optical 
spectrum  measurements.  It  is  important  to  mention  that  the  fol¬ 
lowing  criteria  were  used  for  these  measurements:  1)  the  av¬ 
erage  optical  output  power  was  higher  than  —25  dBm  in  order 
to  ensure  a  high  signal-to-noise  ratio  (SNR),  and  2)  the  dynamic 
extinction  ratio  was  estimated  to  be  >20  dB.  The  pulsewidths 
obtained  from  the  individual  devices  by  keeping  the  other  de¬ 
vice  at  zero  bias  are  shown  in  Fig.  3.  At  a  reverse  bias  of  -4.5 
V,  a  minimum  pulsewidth  of  6.5  ps  and  5.6  ps  were  obtained  for 
the  300-  and  400-^m  devices,  respectively.  Even  though  shorter 
pulses  were  achieved  at  higher  reverse  biases,  degradation  in  the 
dynamic  extinction  ratio  was  observed  due  to  the  absorption  sat¬ 
uration  as  discussed  in  the  preceding  section. 


Reverse  Bias  (V) 


Fig.  3.  Pulsewidth  as  a  function  of  reverse  bias  at  30  GHz  modulation  for  the 
individual  devices.  Circle  symbols:  300-// m  device;  Square  symbols:  400-// m 
device.  Inset:  Autocorrelation  trace  of  the  5.6  ps  pulse  generated  by  the  400-// m 
device  at  a  reverse  bias  of  —4.5  V. 


Reverse  Bias  (V) 

Fig.  4.  Pulsewidth  as  a  function  of  400-// m  reverse  bias  and  for  several 
300-// m  biases  (top-to-bottom)  at  30  GHz  modulation.  Closed  symbols: 
TE  polarization  (top-to-bottom:  -3.5  V,  -4  V,  -4.5  V);  Open  symbols: 
TM  polarization  (top-to-bottom:  -3  V,  -3.5  V,  -4  V).  Left  inset:  Optical 
spectrum;  Right  inset:  Autocorrelation  trace  of  the  4.6  ps  pulse. 

Fig.  4  shows  the  obtained  pulsewidths  as  a  function  of  reverse 
biases  for  the  tandem  configuration.  A  minimum  pulsewidth  of 
4.6  ps  (inset  to  Fig.  4)  with  a  fiber-coupled  output  power  of 
-24.2  dBm  was  achieved  while  an  average  of  5-6  ps  pulses 
were  observed  over  a  wide  range  of  reverse  biases  and  polar¬ 
ization  states.  This  switching  window  is  well  suited  for  >100 
Gbit/s  optical  demultiplexing  applications  [2].  The  inset  to  Fig. 
4  also  shows  the  optical  spectrum  of  the  modulated  tandem 
of  EA  modulators,  which  has  a  gaussian  shape  of  0.75  nm. 
The  time-bandwidth  product  of  0.43  suggests  that  the  pulses 
were  slightly  chirped.  When  the  tandem  was  followed  by  disper¬ 
sion-compensating  fiber  (DCF)  with  a  dispersion  of  about  —6 
ps/nm,  the  pulses  were  linearly  compressed  to  a  transform-lim¬ 
ited  pulsewidth  of  4.2  ps  (Fig.  5).  This  pulsewidth  suggests  that 
the  tandem  is  suitable  as  an  optical  pulse  source  for  simulta¬ 
neous  polarization-  and  TDM  systems  in  excess  of  100  Gbit/s. 

The  optical  switching  response  of  the  tandem  EA  modulators 
was  also  performed  at  40  GHz  with  RF  drives  of  7VPP.  The  fre¬ 
quency  response  of  the  devices  was  estimated  to  be  ~4  dB  lower 
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3  3.5  4  4.5  5 

Reverse  Bias  V^V) 

Fig.  5.  Pulsewidth  as  a  function  of  400-//m  reverse  bias  and  for  several 
300-// m  biases  (top-to-bottom)  at  30  GHz  modulation  (tandem  device 
followed  by  dispersion-compensating  fiber).  Closed  symbols:  TE  polarization 
(top-to-bottom:  —3.5  V,  —4  V,  —4.5  V);  Open  symbols:  TM  polarization 
(top-to-bottom:  -3  V,  -3.5  V,  -4  V).  Inset:  Oscilloscope  trace  of  the  30  GHz 
pulses  (13.3  ps/div). 


deal  pulsewidth  of  5.2  ps  with  an  optical  bandwidth  of  0.59  nm 
was  achieved  (Fig.  5).  These  results  should  improve  when  the 
tandem  is  driven  with  higher  power  RF  amplifiers  at  40  GHz  and 
enable  low-penalty  optical  demultiplexing  of  160  Gbit/s  OTDM 
data  stream  to  40  Gbit/s  [2], 

IV.  Conclusion 

In  summaiy,  we  have  successfully  demonstrated  integrated 
tandem  traveling-wave  EA  modulators  for  >  100  Gbit/s  OTDM 
applications.  Optical  pulses  of  4  to  6  ps  width  were  obtained 
with  high  extinction  ratio,  high  optical  input  power,  and  high 
average  optical  output  power.  These  devices  are  a  viable  tech¬ 
nology  for  optical  demultiplexing  of  bit  rates  up  to  160  Gbit/s. 
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Reverse  Bias  V^V) 


Fig.  6.  Pulsewidth  as  a  function  of  400-// m  reverse  bias  and  for  several 
300-// m  biases  (top-to-bottom)  at  40  GHz  modulation.  Closed  symbols' 
TE  polarization  (top-to-bottom:  -3  V,  -3.5  V,  -4  V);  Open  symbols-  TM 
polarization  (top-to-bottom:  -3  V,  -3.5  V).  Inset:  Autocorrelation  trace  of  the 
5.2  ps  pulse. 

at  40  GHz  (in  comparison  to  the  30  GHz  response),  which  re¬ 
sulted  in  a  compromise  between  pulsewidth,  dynamic  extinc¬ 
tion  ratio  and  average  optical  output  power.  A  minimum  op- 
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Double-Bonded  InP-InGaAsP  Vertical  Coupler 

1 : 8  Beam  Splitter 

Maura  Rabum,  Bin  Liu,  Patrick  Abraham,  and  John  E.  Bowers 


Abstract — A  novel  three-layer  double-bonded  InP-InGaAsP 
waveguide  vertical  coupler  1 : 8  beam  splitter  is  demonstrated. 
The  strongly  coupled  waveguides  allow7  a  583-pm  device  length, 
more  than  100  times  shorter  than  that  of  the  equivalent  horizontal 
coupler.  The  device  illustrates  the  use  of  multiple  vertical-layer 
optical  interconnects  for  three-dimensional  routing  of  optical 
signals. 

Index  Terms — Beam  splitting,  optical  directional  couplers,  semi¬ 
conductor  waveguides,  wafer  bonding,  waveguide  couplers. 

I.  Introduction 

HE  PHOTONIC  integrated  circuit  (IC)  industry  has  yet  to 
witness  the  rapid  developments  made  in  the  electronic  IC 
industry  for  several  reasons,  one  of  which  is  the  confinement  of 
photonic  circuits  to  two  dimensions.  The  complexity  of  two-di¬ 
mensional  photonic  ICs  is  limited  by  the  substrate  size  and  the 
difficulty  in  connecting  large  numbers  of  input  and  output  fibers 
or  electrical  connections.  By  making  the  leap  to  multilayer  inter¬ 
connects,  more  compact  devices  can  be  obtained.  Fewer  connec¬ 
tions  between  chips  are  required  and  some  devices  can  be  made 
smaller  than  their  in-plane  counterparts.  The  need  to  combine 
different  materials  on  a  single  chip  can  be  addressed  as  well. 
Three-dimensional  routing  of  signals  will  thus  be  necessary  for 
significantly  more  compact  and  powerful  photonic  ICs.  How¬ 
ever,  there  have  been  relatively  few  optical  devices  in  the  litera¬ 
ture  that  have  been  developed  specifically  for  the  realization  of 
three-dimensional  photonic  ICs. 

Vertical  directional  couplers  allow  for  very  compact  and  fast 
three-dimensional  optical  switching  devices,  with  coupling 
lengths  less  than  40  pm  [1],  [2].  Unfortunately,  traditional 
vertical  couplers  have  the  drawbacks  that  the  inputs  and  outputs 
are  difficult  to  separate,  and  the  materials  are  restricted  by 
growth  parameters.  Wafer  bonding  overcomes  these  limitations 
by  allowing  separated  input  and  output  waveguides  and  the 
joining  of  materials  of  different  lattice  constants  and  crystal¬ 
lographic  orientations.  Using  bonded  vertical  couplers,  a  large 
number  of  photonic  IC  planar  layers  of  various  compositions 
are  possible,  and  with  the  right  fabrication  conditions,  the 
current  six  to  seven  layers  for  typical  electronic  ICs  may 
eventually  be  surpassed  [3] — [5].  We  describe  in  this  letter  the 
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use  of  three-layer  photonic  interconnects  to  make  a  compact 
semiconductor  1 : 8  beam  splitter. 

II.  Device  Fabrication  and  Structure 

Cascaded  3-dB  vertical  couplers  incorporating  three  layers 
(two  InP-to-InP  bondings)  were  fabricated.  Three  wafers 
were  grown  using  metal-organic  chemical  vapor  deposition 
(MOCVD).  Due  to  the  inversion  resulting  from  the  bonding, 
the  order  in  which  the  wafers  were  processed  is  the  opposite  of 
the  top-to-bottom  order  of  the  layers  of  the  resulting  bonded 
device.  For  the  first  (bottom  layer)  wafer,  on  a  (001)  InP 
substrate,  a  0.5 -pm  InGaAsP  ( \g  =  1.3  pm)  guiding  layer, 
followed  by  a  0.1 -pm  InP  cladding  layer  were  grown.  All 
layers  were  undoped.  The  second  (middle  layer)  wafer  was 
grown  on  a  (001)  InP  substrate  as  well.  It  consisted  of  a  0.2-pm 
InGaAs  etch  stop  layer,  followed  by  a  0.1 -pm  InP  cladding 
layer,  a  0.5-pm  InGaAsP  (A^  =  1.3  pm)  guiding  layer,  and  a 
0.5-pm  InP  support  layer.  For  the  third  (top  layer)  wafer,  on 
another  (001)  InP  substrate,  a  0.2-pm  InGaAs  etch  stop  layer, 
followed  by  a  0.9-pm  InP  cladding  layer,  a  0.5-pm  InGaAsP 
(A^  =  1.3  pm)  guiding  layer,  and  a  0.5-pm  InP  support  layer 
were  grown. 

The  device  fabrication  begins  with  the  cleaving  of  the  first 
wafer  into  a  1.3  x  1.5-cm  sample,  deposition  of  SiN,  and  stan¬ 
dard  photolithography  with  the  corresponding  removal  of  SiN  to 
define  the  position  of  the  bottom-layer  waveguides.  The  waveg¬ 
uides  were  then  etched  using  CH4-H2-Ar  reactive  ion  etching, 
the  SiN  was  removed,  and  the  sample  was  bonded  at  630  °C 
in  a  hydrogen  atmosphere  for  50  minutes  to  a  blank  sample  of 
the  second  wafer  cleaved  to  the  same  size.  The  substrate  of  the 
second  wafer  was  removed  in  HC1  and  the  InGaAs  etch  stop 
layer  was  removed  in  H2SO4-H2O2.  After  substrate  removal, 
the  processing  for  the  second  and  third  layers  is  identical  to 
that  of  the  first  except  that  alignment  marks  were  uncovered 
using  infrared  photolithography  and  wet  etching  with  HC1  and 
H2SO4-H2O2  after  the  SiN  deposition.  Lastly,  the  sample  is 
cleaved  into  devices  approximately  2  mm  long  including  input 
and  output  waveguides  to  allow  flexibility  in  the  position  of  the 
cleave. 

The  waveguides  were  3  and  4  pm  wide.  Support  regions  were 
placed  10  pm  away  from  all  waveguides  to  ensure  that  the  struc¬ 
ture  did  not  deform.  Unwanted  coupling  from  the  waveguides  to 
the  InGaAsP-InP  support  regions  on  adjacent  layers  will  take 
place  if  the  waveguides  on  one  layer  are  allowed  to  be  close  to 
the  InGaAsP  of  the  other  layer.  The  InP  cladding  region  and  In¬ 
GaAsP  guiding  regions  were  hence  removed  from  within  10  pm 
of  waveguides  on  adjacent  layers  to  prevent  unwanted  coupling. 
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Fig.  1 .  SEM  of  output  waveguides.  From  left  to  right,  top,  middle,  and  bottom 
waveguides  are  shown.  Bonded  region  below  top  waveguide  is  for  support. 


Fig.  1  shows  a  SEM  of  the  cleaved  side  of  the  device.  To  pre¬ 
vent  a  coupling  asymmetry  due  to  the  lack  of  a  0.5-/zm  InP 
support  layer  above  it,  the  top  layer  waveguide  (1.4  /zm  high) 
was  made  taller  than  the  bottom  and  middle  layer  waveguides 
(0.6  /zm  high). 


in.  Device  Design 
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input  ^ 


The  principle  of  operation  of  the  1:8  beam  splitter  is  one 
of  two-mode  interference.  That  is,  the  coupling  of  light  is  due 
to  the  overlap  of  the  evanescent  fields  of  the  two  waveguides 
such  that  after  a  given  length,  light  entering  one  waveguide 
will  couple  half  of  its  power  to  the  other  waveguide  if  the 
propagation  constants  of  both  individual  waveguides  are 
similar.  Using  the  three-dimensional  finite-difference  beam 
propagation  method  (BPM  [6]),  the  performance  was  simulated 
for  various  materials,  coupling  lengths,  s-bend  lengths,  and 
waveguide  heights.  1.3-/zm  InGaAsP  (n  —  3.37)  was  chosen  as 
the  waveguide  layer  rather  than  InGaAsP  of  a  larger  index  of  re¬ 
fraction  because  lower  index  waveguide  layers  result  in  shorter 
coupling  lengths;  this  can  be  explained  in  a  coupled-mode 
picture  as  an  increase  in  the  overlap  integral  of  the  two  modes 
of  adjacent  waveguides  [7].  The  choices  of  waveguide  height 
and  separation  were  also  influenced  by  the  high  index  of  the 
waveguide  layer.  For  the  structure  described  earlier,  thinner 
waveguide  layers,  by  providing  less  modal  confinement,  will 
also  increase  the  overlap  integral  between  modes  of  adjacent 
waveguides  and  will  thus  reduce  the  coupling  length  [7]; 
less  separation  between  waveguides  will  also  provide  shorter 
couplers.  With  the  material  and  waveguide  heights  used  above, 
the  3-dB  coupling  lengths  for  the  three  stages  of  the  1 : 8  splitter 
are  37  /zm,  39  /zm,  and  47  //m,  respectively.  The  total  device 
length,  not  including  the  straight  input  and  output  waveguides 
appended  for  ease  of  cleaving,  is  583  /zm. 

By  contrast,  a  beam  splitter  made  from  horizontal  couplers 
using  the  same  material  and  waveguide  heights  and  widths 
as  in  layer  1,  and  with  1  /zm  separation  between  waveguides 
would  have  a  3-dB  coupling  length  of  4.73  cm  for  a  single 
stage  consisting  of  straight  waveguides  only.  Using  s-bends 
long  enough  to  allow  the  same  output  waveguide  spacing  as  the 
bonded  splitter,  the  total  device  length  would  be  approximately 
14  cm.  Although  a  smaller  waveguide  separation  would  allow 
a  shorter  coupling  length,  the  difficulty  in  making  reproducible 
and  uniform  narrow-gap  « 1  /zm)  horizontal  couplers  have  hin¬ 
dered  their  development  for  ultra-short  splitting  and  switching 
devices  [7],  Thus,  vertical  couplers  offer  a  great  advantage  in 
terms  of  device  compactness. 


Fig.  2.  Device  output  at  A  =  1483  nm  as  captured  by  IR  camera,  and 
corresponding  waveguide  layout. 
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Fig.  3.  Peaks  of  line  scan  of  device  output  at  1483  and  1550  nm  for  best 
polarization  and  at  1483  nm  for  worst  polarization. 

IV.  Results 

The  device  output  at  A  =  1483  nm  as  captured  by  an  in¬ 
frared  (IR)  camera  and  the  corresponding  waveguide  positions 
are  shown  in  Fig.  2.  Light  from  a  tunable  semiconductor  laser  is 
coupled  to  the  waveguides  via  a  single-mode  lensed  fiber.  The 
output  beams  have  slight  height  differences  corresponding  to 
the  waveguide  heights.  They  are  separated  by  15  /zm  for  ease  of 
coupling  to  the  output  fiber. 

To  measure  the  polarization  and  wavelength  sensitivity  of  the 
device,  the  output  of  the  device  was  coupled  to  a  single-mode 
lensed  fiber  connected  to  a  detector.  The  output  fiber  was 
mounted  on  a  computer-controlled  adjustable  xyz  stage,  and 
was  scanned  across  the  output  waveguides  at  different  polar¬ 
izations  and  wavelengths.  As  shown  in  Fig.  3,  although  the 
splitter  can  operate  at  different  wavelengths  and  polarizations, 
the  ratios  of  light  intensity  coupled  to  the  various  output 
waveguides  changes  somewhat.  The  unevenness  in  splitting  is 
thought  to  arise  from  the  less-than-ideal  alignment  obtained 
with  a  standard  mask  aligner,  as  the  alignment  tolerance  was 
simulated  to  be  approximately  0.25  /zm.  It  is  believed  that  more 
uniform  splitting  could  be  obtained  through  the  use  of  a  stepper 
mask  aligner.  The  wavelength  dependence  could  be  lessened  by 
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Fig.  4.  Proposed  demultiplexer  with  structure  similar  to  that  of  beam  splitter 
except  for  the  coupler  lengths. 

designing  even  shorter  couplers,  for  which  the  s-bends  provide 
all  of  the  coupling.  The  polarization  dependence  is  a  function  of 
the  structure  and  material  parameters  as  well;  the  device  could 
be  made  polarization-independent  through  a  modification  of 
waveguide  design  [8]. 

A  Fabry-Perot  resonance  technique  was  used  to  measure  the 
waveguide  optical  loss  [9].  Straight  waveguide  regions  were 
cleaved  off  of  the  sample  and  tested  to  give  1  dB  of  loss  per 
583-/im  1 : 8  splitter  length.  Total  s-bend  losses  were  calculated 
using  BPM  [6]  to  be  0.9  dB. 

Future  considerations  include  an  optimization  of  the  wave¬ 
guide  s-bends  through  use  of  the  conformal  transformation 
method  [10].  The  default  s-bend  shape  provided  by  the  BPM 
simulation  program  [6]  was  composed  of  two  circular  arcs. 
This  design  does  not  provide  a  minimum  s-bend  loss;  a  further 
reduction  could  be  obtained  through  a  conformal  transforma¬ 
tion  analysis. 


Another  direction  to  be  investigated  is  the  transformation 
of  the  three-layer  beam  splitter  into  a  demultiplexer  through 
a  change  to  the  lengths  of  the  coupling  regions.  This  would 
be  achieved  by  utilizing  the  wavelength  dependence  of  the 
coupling.  Couplers  on  each  successive  stage  can  be  chosen  to 
have  lengths  such  that  every  other  channel  is  split  to  a  different 
output  waveguide,  as  shown  in  Fig.  4. 


V.  Conclusion 

A  novel  three-layer  bonded  vertical  coupler  1 : 8  beam  splitter 
is  demonstrated.  To  our  knowledge,  this  is  the  first  three-layer 
three-dimensional  waveguide  beam  splitter.  A  device  length  of 
583  fzm,  two  orders  of  magnitude  smaller  than  the  length  of 
an  equivalent  horizontal  coupler  splitter,  is  obtained.  This  illus¬ 
trates  the  powerful  potential  of  the  use  of  wafer  bonding  to  fab¬ 
ricate  three-dimensional  photonic  integrated  circuits. 
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Abstract— A  vertically  coupled  InP-InGaAsP  crossed  wave¬ 
guide  optical  add-drop  multiplexer  has  been  realized  through 
the  use  of  wafer  bonding.  Designed  for  signals  in  the  1550-nm 
range,  this  novel  device  requires  only  a  single  epitaxial  growth 
and  illustrates  the  use  of  vertical  optical  interconnects  for  the 
three-dimensional  routing  of  optical  signals.  To  our  knowledge,  it 
is  also  one  of  the  first  optical  vertically  coupled  devices  with  no 
horizontally  coupled  counterpart. 

Index  Terms — Optical  directional  couplers,  optical  filters,  semi¬ 
conductor  waveguides,  wafer  bonding,  waveguide  couplers,  wave¬ 
length-division  multiplexing  (WDM). 


I.  Introduction 

OPTICAL  add-drop  multiplexers  (OADMs)  are  key 
channel  routing  components  in  wavelength-divi¬ 
sion-multiplexed  (WDM)  systems.  OADM  implementation 
methods  include  fiber  or  polymer  gratings  with  circula¬ 
tors  [1],  arrayed  waveguides  [2],  and  cascaded  unbalanced 
Mach-Zehnder  structures  [3].  InP-InGaAsP  OADMs  [4]  are 
of  particular  interest  because  monolithic  integration  of  other 
optoelectronic  devices  onto  the  same  chip  is  possible.  Vertical 
coupling  enables  the  creation  of  simpler,  shorter  devices  than 
equivalent  horizontally  coupled  structures,  and  also  allows 
laterally  separated  inputs  and  outputs  for  ease  of  coupling  to 
fibers  when  wafer  bonding  is  employed  [5]. 

In  this  letter,  a  simple  multichannel  OADM  is  proposed  and 
demonstrated.  The  device  consists  of  straight  waveguides  that 
cross  to  form  Xs.  One  “in/through”  top  waveguide  is  vertically 
coupled  to,  and  crossed  at  an  angle  of  0.3°  with,  four  parallel 
add-drop”  bottom  waveguides  (Fig.  1).  Only  one  growth  was 
required  for  the  structure  because  waveguides  were  etched  on 
both  surfaces  of  the  epitaxial  material.  The  OADM  operation 
is  based  on  coherent  coupling  of  light  between  the  top  wave¬ 
guide  and  a  bottom  waveguide,  which  will  only  take  place  when 
their  effective  refractive  indices  are  nearly  identical.  The  dif¬ 
ferent  widths  of  the  four  add-drop  waveguides  thus  correspond 
to  different  channels,  as  the  effective  indices  are  functions  of 
both  wavelength  and  waveguide  width. 
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Fig.  1.  Layout  of  the  OADM. 

II.  Device  Fabrication  and  Structure 

Vertically  coupled  waveguides  that  cross  to  form  long, 
narrow  Xs  were  fabricated.  One  wafer  was  grown  using 
metal-organic  chemical  vapor  deposition  (MOCVD).  On 
a  (001)  InP  substrate,  a  0.2-/un  InGaAs  etch  stop,  0.8-yum 
InP  layer,  15-nm  InGaAsP  etch  stop,  0.2-//m  InP  cladding 
layer,  l-/xm  InGaAsP  (A5  =  1.066  //m)  guiding  layer, 
1.5-^m  InP  cladding  and  support  layer,  0.22-/zm  InGaAsP 
(A^  =  1.359  ^m)  guiding  layer,  0.4-/zm  InP  cladding  layer, 
15-nm  InGaAsP  etch  stop,  and  0.8-//m  InP  layer  were  grown. 

The  bottom  waveguides  were  fabricated  first  because  the 
bonding  and  substrate  removal  steps  in  effect  reverse  the 
epitaxial  growth  order.  The  device  fabrication  began  with  the 
deposition  of  SiN,  standard  photolithography,  and  CF4  plasma 
etching  to  form  a  SiN  etch  mask.  Wet  etching  of  the  top  0.8-^m 
InP  layer  (HC1:H20=2:1)  defined  the  position  of  the  bottom 
layer  waveguides.  Another  set  of  SiN,  photolithography,  and 
wet  etching  steps  (HC1:H20=2:1,  H2SO4:H2O2:H2O=l:l:10) 
removed  the  etch  stop  layer,  the  0.4-^m  InP  layer  and  the 
0.22-^m  bottom  guiding  layer  in  certain  regions  (Fig.  1). 
These  layers  will  induce  unwanted  coupling  of  light  from 
the  top  waveguide  to  the  slab  mode  of  the  bottom  waveguide 
guiding  layer  if  allowed  to  remain.  After  thorough  cleaning, 
the  sample  was  bonded  at  630  °C  in  a  nitrogen  atmosphere 
for  50  min  to  a  new,  blank  InP  wafer  cleaved  to  the  same  size. 
The  substrate  and  the  0.2-^um  etch  stop  layer  of  the  grown 
wafer  were  removed  via  wet  etching.  The  backside  of  the  new 
InP  wafer  was  then  polished  to  aide  in  the  subsequent  infrared 
(IR)  photolithography,  which  defined  the  top  layer  waveguides 
while  alignment  with  the  bottom-level  pattern  was  maintained. 
Processing  of  the  top  waveguides  was  the  same  as  that  for 
the  bottom  except  CH4-H2-Ar  reactive  ion  etching  was  used 
instead  of  wet  etching.  Lastly,  the  sample  was  cleaved. 


1041-1 135/01$10.00©  2001  IEEE 


580 


IEEE  PHOTONICS  TECHNOLOGY  LETTERS,  VOL.  13,  NO.  6,  JUNE  2001 


1527nm 

1517nm 

1498nm 

1482nm 


Fig.  2.  SEM  of  sample  resulting  from  cleaving  of  output  facet,  including  Fig.  3.  Output  of  OADM  as  captured  with  IR  camera  and  80 X  microscope 
in/through  waveguide  and  four  add-drop  waveguides.  objective  lens. 


EL  Device  Design 

The  finished  OADMs  were  60  /zm  wide  and  1.2  cm  long. 
Each  device  has  one  top  in/through  waveguide  of  width  3  /zm 
and  four  parallel  bottom  add-drop  waveguides  of  width  2,  3, 
4,  and  5  /zn a,  separated  laterally  by  10  /zm  for  coupling  into 
lensed  fibers  (Fig.  2).  Since  growth  variations  cause  changes  in 
the  fraction  of  light  coupled,  crossing  angles  between  the  top 
waveguide  and  the  four  bottom  waveguides  ranging  from  0.2° 
to  0.45°  were  used.  The  x-crossing  arrangement  allowed  by  the 
bonding  also  provides  lateral  separation  of  the  top  waveguide 
from  the  four  bottom  waveguides.  Support  regions  were  placed 
10  /zm  away  from  all  waveguides  to  ensure  that  neither  defor¬ 
mation  of  the  structure  nor  unwanted  coupling  to  the  supports 
would  occur. 

If  the  difference  between  the  propagation  constants  of  two 
crossed  waveguides  is  very  small,  after  a  given  length,  light  en¬ 
tering  one  waveguide  will  couple  completely  to  the  other  wave¬ 
guide.  Dissimilar  indexes  and  dimensions  for  the  top  and  bottom 
waveguides  were  chosen  because  the  difference  in  material  and 
waveguide  dispersion  allows  complete  coupling  over  a  much 
narrower  wavelength  range  than  with  identical  coupled  waveg¬ 
uides.  Away  from  the  center  wavelength,  the  coupling  will  be 
very  small  due  to  the  phase  mismatch.  The  x-crossing  geometry 
reduces  coherent  coupling  of  sidelobes  as  well  [4]. 

The  effective  index  and  transfer  matrix  methods  [6],  [7]  were 
used  to  calculate  the  effective  refractive  indexes  at  different 
wavelengths,  material  compositions,  and  waveguide  widths. 
The  material  compositions  were  altered  slightly  from  the 
previous  single  channel  OADM  design  [4]  because  it  was 
found  that  [8]  while  the  formerly  used  Henry  et  al  approach 
[9]  gave  a  satisfactory  relationship  between  bandgap  and 
index  of  refraction  of  InGaAsP  for  smaller  bandgaps  (1.3-1.55 
/zm),  the  expression  derived  by  Weber  [10]  provided  a  better 
approximation  for  higher  bandgaps  (~1.1  /zm). 

This  four-channel  OADM  does  not  have  a  horizontally  cou¬ 
pled  equivalent,  as  the  waveguides  cannot  be  rearranged  or  bent 
in  such  a  manner  that  they  yield  an  in-plane  device  with  the  same 
capabilities  without  excessive  losses  or  crosstalk.  The  dissimi¬ 
larity  between  the  top  and  bottom  waveguides  further  compli¬ 
cates  any  approach  to  reduce  the  number  of  dimensions.  Thus, 
this  device  illustrates  the  wonderful  flexibility  in  layout  afforded 
by  wafer  bonding  to  form  vertically  coupled  structures. 
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Fig.  4.  Smoothed  (wavelength  averaged)  wavelength  scan  of  OADM. 


IV.  Results 

A  tunable  semiconductor  laser  with  a  polarization  con¬ 
troller  was  used  to  investigate  the  behavior  of  the  devices. 
The  bandgaps  of  the  InGaAsP  regions  of  the  actual  growth 
(A g  =  1.066  /zm ,  Xg  =  1.359  /zm)  deviated  slightly  from 
that  of  the  desired  growth  (A^  =  1.068  /zm,  Xg  =  1.370  /zm) 
needed  for  room  temperature  operation.  This  offset  necessitated 
the  heating  of  the  sample  to  104.3  °C  to  shift  the  indices  of  the 
material  to  allow  coupling  within  the  range  of  the  tunable  laser. 
The  temperature-induced  shift  in  the  center  wavelengths  of  the 
drop  channels  was  0.39  nm/°C. 

The  center  wavelength  of  the  device  has  a  strong  dependence 
on  the  InGaAsP  bandgaps  due  to  the  large  dispersion  difference 
between  the  top  and  bottom  waveguides.  Increasing  the  bandgap 
wavelength  of  the  top  waveguide  InGaAsP  by  1  nm  decreases 
the  center  wavelength  of  the  device  by  4. 1  nm;  the  ratio  of  wave¬ 
length  shifts  for  the  bottom  InGaAsP  is  2.5:1.  The  growth  vari¬ 
ation  of  InGaAsP  bandgaps  was  10  nm. 

The  near-field  output  is  shown  in  Fig.  3  at  the  peak  wave¬ 
length  of  each  channel.  The  four  channels  lie  at  1482,  1498, 
1517,  and  1527  nm.  Fig.  4  shows  a  wavelength  scan  of  the 
OADM  throughput  and  the  first  three  channels.  Unfortunately, 
the  fourth  channel  was  damaged  before  the  data  for  Fig.  4  was 
taken.  The  high  temperature  required  for  operation  of  the  device 
aggravated  air  drafts  around  the  lensed  fibers,  resulting  in  noise 
despite  attempts  to  shield  the  setup;  smoothed  data  is  hence 
plotted. 
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The  fiber-to-fiber  device  loss  was  11.3  dB.  The  in/through 
waveguide  loss  was  7.7  dB/cm  using  a  Fabrv-Perot  resonance 
technique  [11]. 

V.  Conclusion 

A  novel  InP-InGaAsP  vertical  coupler  OADM  is  demon¬ 
strated.  Though  the  layout  is  simple,  the  device  cannot  be  re¬ 
duced  to  two  dimensions.  This  illustrates  the  powerful  potential 
of  wafer  bonding  to  fabricate  three-dimensional  photonic  inte¬ 
grated  circuits. 


Fig.  5.  Theoretical  OADM  bandwidth  as  determined  by  crossing  angle, 
thickness  of  InP  between  the  two  InGaAsP  guiding  layers. 

The  four  bottom  waveguide  widths  were  chosen  such  that 
the  four  channels  would  be  close  in  wavelength  yet  suffer  low 
crosstalk.  Though  due  to  the  undesirable  growth,  this  was  not  the 
case  for  channels  1  and  2,  the  crosstalk  between  channels  2  and  3 
is  less  than  - 17  dB.  Adjusting  the  waveguide  widths  for  20-nm 
channel  spacing  would  give  a  maximum  crosstalk  of  - 15  dB  for 
all  channels.  This  wide  channel  spacing  indicates  suitability  for 
datacom  rather  than  telecom  operation.  The  bandwidth  can  be 
reduced  by  decreasing  the  crossing  angle,  decreasing  the  thick¬ 
ness  of  InP  between  the  InGaAsP  guiding  layers  (Fig.  5),  or 
using  guiding  materials  with  more  different  dispersion  charac¬ 
teristics  (e.g.,  InP-InGaAsP  bonded  to  AlGaAs-GaAs). 

82%  of  channel  1, 93%  of  channel  2,  and  65%  of  channel  3  are 
dropped.  Simulations  predict  that  the  maximum  intensities  of  all 
channels  can  be  made  equal  by  using  slightly  different  crossing 
angles  between  the  in/through  waveguide  and  each  add-drop 
waveguide.  The  percentage  of  power  dropped  can  be  increased 
through  fine-tuning  the  crossing  angles  as  well. 

Another  property  matching  theoretical  predictions  is  stronger 
coupling  for  TE  polarizations.  TE  and  TM  coupling  wave¬ 
lengths  were  found  to  differ  by  200  nm.  TM-oriented  light 
resulted  in  a  drop  channel  intensity  1 1  dB  less  than  that  for 
TE  light.  The  polarization  sensitivity  results  from  the  strong 
waveguide  geometry  asymmetry;  large  material  dispersion 
differences  rather  than  waveguide  dispersion  differences  (e.g., 
InP-InGaAsP  bonded  to  AlGaAs-GaAs)  would  allow  TE  and 
TM  coupling  at  the  same  wavelength.  An  OADM  with  wave¬ 
guide  parameters  adjusted  for  TM  coupling  can  be  cascaded 
after  the  OADM  described  here  for  polarization-independent 
coupling  as  well. 
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Dispersion-Shifted  Optical  Fiber 

Peter  Ohlen,  Bengt-Erik  Olsson,  and  Daniel  J.  Blumenthal,  Member,  IEEE 


Abstract — A  40-Gb/s  wavelength  converter  based  on  cross-phase 
modulation  in  a  dispersion-shifted  fiber  has  been  investigated 
through  pulse  measurements  and  simulation.  The  most  important 
parameter  is  the  dispersive  walkoff,  which  makes  the  required 
input  power  wavelength  dependent  and  determines  the  optical 
bandwidth.  Simulations  show  the  feasability  of  the  160-Gb/s 
operation  by  using  2-ps  pulses  and  a  highly  nonlinear  disper¬ 
sion-shifted  fiber. 

Index  Terms — Cross-phase  modulation,  nonlinear  fiber  optics, 
optical  fiber  communication,  optical  networks,  wavelength  con¬ 
verters. 


I.  Introduction 

ALL-OPTICAL  wavelength  conversion  can  play  an  impor¬ 
tant  role  in  future  ultrahigh-speed  networks  using  wave¬ 
length-division  multiplexing  (WDM).  The  wavelength  channels 
will  probably  be  switched  by  optical  add/drop  multiplexers,  op¬ 
tical  crossconnects,  and  possibly  optical  packet  switching  tech¬ 
niques.  Wavelength  converters  will  greatly  increase  the  flexi¬ 
bility  in  such  networks  and  can  be  used  as  a  basic  switching  el¬ 
ement  in,  e.g.,  an  optical  crossconnect  or  an  optical  packet  router 
[1]. 

Ultrahigh-speed  wavelength  conversion  of  retum-to-zero 
(RZ)  data  has  previously  been  demonstrated  using  four-wave 
mixing  (FWM)  in  fiber  [2]  and  semiconductor  optical  ampli¬ 
fiers  [3]  and  by  use  of  cross-phase  modulation  (XPM)  in  the 
nonlinear  optical  loop  mirror  [4].  Schemes  using  XPM  and 
soliton  formation  [5]  or  XPM  and  polarization  discrimination 
[6]  have  previously  been  proposed.  We  have  recently  demon¬ 
strated  a  different  XPM-based  scheme  to  wavelength  convert 
40-Gb/s  RZ  data  [7]. 

The  basic  idea  of  this  scheme  is  to  utilize  XPM  in  a  dispersion 
shifted  fiber.  Continuous-wave  (CW)  light  is  launched  into  the 
fiber  along  with  the  data  pulses.  The  pulses  will  impose  a  phase 
modulation  that  generates  sidebands  on  the  CW  light.  After  sup¬ 
pression  of  the  original  CW  wavelength,  and  filtering  out  one 
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Fig.  1.  Experimental  setup  of  the  XPM  wavelength  converter.  Er-FRL: 
fiber  ring  laser,  TECL:  tunable  external  cavity  laser,  MOD:  Mach-Zehnder 
modulator;  EDFA:  erbium-doped  fiber  amplifier;  PC:  polarization  controller; 
PMF:  polarization  maintaining  fiber,  DSF:  dispersion  shifted  fiber. 

of  the  generated  sidebands,  the  wavelength-converted  data  re¬ 
main  at  the  output.  Here  we  report  on  pulse  measurements  in 
the  wavelength  converter  in  order  to  assess  its  fundamental  op¬ 
erating  characteristics  at  40  Gb/s,  as  well  as  simulations  using 
a  highly  nonlinear  fiber  and  shorter  pulses  aiming  at  160  Gb/s. 
For  a  wavelength  converter,  both  the  maximum  data  rate  as  well 
as  the  conversion  wavelength  span  are  important.  When  RZ  data 
are  used,  it  is  important  that  the  output  pulsewidth  is  retained  as 
well  as  having  a  stable  operating  point  of  the  wavelength  con¬ 
verter.  When  the  wavelength  changes,  the  optimum  operating 
point  will  change,  and  the  input  power  would  have  to  be  con¬ 
trolled  to  compensate  for  this.  Together,  these  two  effects  will 
determine  the  optical  bandwidth  and  the  maximum  bit  rate  of 
the  wavelength  converter. 

n.  Experiment 

Fig.  1  shows  the  experimental  setup  where  8-ps  pulses  with 
a  repetition  rate  of  10  GHz  were  generated  from  an  actively 
mode-locked  erbium-doped  fiber  ring  laser,  suitable  for  40-Gb/s 
data.  The  pulses  were  gated  with  a  Mach-Zehnder  modulator 
to  increase  the  available  optical  peak  power,  combined  with 
CW  light  from  a  tunable  external-cavity  laser,  and  sent  through 
5  km  of  dispersion-shifted  fiber  with  a  zero-dispersion  wave¬ 
length  of  1542  nm.  The  pulse  power  was  monitored  at  one  port 
of  the  combiner.  After  the  fiber,  a  loop-mirror  filter  (LMF)  was 
used  to  notch  out  the  original  CW  light,  and  a  0.4-nm  band¬ 
pass  filter  was  used  to  select  one  of  the  two  generated  side¬ 
bands.  The  loop-mirror  filter  has  a  sinusoidal  filter  function  with 
repetitive  notches,  separated  by  1  nm,  and  27-dB  suppression  in 
the  notches.  An  optically  preamplified  receiver  with  a  0.6-nm 
noise-suppression  filter  and  a  sampling  oscilloscope  was  used 
to  measure  the  pulse  characteristics. 

In  the  first  experiment,  pulses  at  1535  nm  were  converted 
to  longer  wavelengths,  and  the  CW  light  was  varied  from 
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Fig.  2.  Measured  transfer  functions  for  the  wavelength  converter  using  8-ps 
input  pulses  and  5-km  dispersion-shifted  fiber. 

1538  to  1558  nm.  Depending  on  which  sideband  is  chosen, 
the  behavior  is  somewhat  different.  For  the  left  sideband,  the 
output  pulsewidth  of  12  ps  is  essentially  transform  limited  over 
this  wavelength  range  and  given  by  the  optical  filter  bandwidth. 
The  right  sideband  shows  the  same  behavior  up  to  1550  nm 
where  the  pulsewidth  starts  to  increase,  due  to  asymmetric 
spectral  broadening  due  to  SPM.  In  terms  of  optical  power, 
the  measured  input-output  transfer  function  has  a  nonlinear 
shape  shown  in  Fig.  2  for  three  different  wavelengths.  When 
the  wavelength  separation  between  the  pump  and  the  CW 
light  increases,  the  input  pulse  power  required  for  maximum 
output  power  increases  due  to  dispersive  walkoff  between 
die  input  pulses  and  the  CW  light.  The  input  pulse  power 
is  then  distributed  over  a  time  slice  of  the  CW  light,  which 
corresponds  to  this  walkoff.  The  maximum  of  the  transfer 
function  is  chosen  as  the  operating  point.  It  should  be  noted 
that  the  wavelength  converter  can  be  used  with  input  powers 
below  the  maximum  of  the  transfer  function,  but  this  is  a 
good  reference  point  for  the  measurements  and  would  also  to 
some  extent  absorb  variations  in  the  input  power.  As  shown 
in  Fig.  3(a),  the  required  input  power  for  maximum  output 
power  increases  when  the  wavelength  is  changed  from  1538 
nm  to  the  zero-dispersion  wavelength,  where  it  decreases 
until  the  walkoff  is  zero  around  1549  nm.  At  this  point,  the 
required  power  starts  increasing  very  rapidly,  due  to  higher 
dispersion.  This  behavior  can  be  explained  from  simple  theory. 

If  pulse  dispersion  and  loss  are  neglected,  the  frequency  shift 
of  the  CW  light  for  Gaussian  pulses  can  be  described  by 
Af  ~  pm£[exp(— (t  -  ft)2)  -  exp (~t2)]/0,  where  Pin  is  the 
input  power  and  L  the  fiber  length.  The  walkoff  time  ft  and  the 
time  t  are  normalized  to  the  1/e  half-width  of  the  input  pulse.  In 
this  equation,  an  increased  ft  requires  an  increased  power  to  get 
the  same  frequency  shift.  This  increase  in  power  for  constant 
frequency  shift  is  shown  together  with  the  measured  data  in 
Fig.  3(a)  after  fitting  the  power  required  for  zero  walkoff.  In  a 
second  experiment,  we  show  conversion  of  pulses  at  1553  nm 
to  shorter  wavelengths.  Fig.  3(b)  shows  that  the  required  input 
power  has  a  similar  shape  as  for  upconversion.  However,  this 
turns  out  to  be  more  difficult  because  the  pump  is  located  in  the 
anomalous  dispersion  regime,  where  the  spectral  broadening 
is  much  more  significant  than  in  the  normal  dispersion  regime. 


Fig.  3.  Required  input  power  to  reach  the  maximum  of  the  transfer  function 
Input  pulses  at  (a)  1535  and  (b)  1553  nm. 

and  higher  order  solitons  will  be  generated.  When  the  separa¬ 
tion  between  the  two  wavelengths  gets  smaller,  the  spectrally 
broadened  input  pulses  will  interfere  with  the  CW  light,  which 
generates  interferometric  noise  in  the  output  pulses.  This  effect 
starts  to  be  significant  at  1537  nm  for  input  pulses  at  1535  nm 
and  at  1547  nm  for  input  pulses  at  1553  nm. 

in.  Simulation 

The  wavelength  converter  used  in  the  experiment  was 
simulated  using  the  nonlinear  Schrodinger  equation  and  8-ps 
unchirped  Gaussian  pulses.  For  the  5-km  normal  dispersion 
shifted  fiber  (DSF),  the  zero-dispersion  wavelength  was  1542 
nm,  the  dispersion  slope  was  0.086  ps/nm2/km,  the  nonlinear 
refractive  index  n,  =  2.8  •  10-20m2/W,  the  effective  area 
60  /rm~,  and  the  loss  0.4  dB/km.  The  simulated  pulsewidths 
for  the  left  sideband  vary  between  12  and  13.5  ps  for  wave¬ 
lengths  between  1538  and  1555  nm,  and  the  right  sideband 
has  similar  pulsewidths  until  1550  nm  where  the  pulsewidth 
starts  increasing,  which  is  the  same  behavior  as  seen  in  the 
experiment.  The  required  input  power  to  reach  the  maximum 
of  the  transmission  function  is  shown  in  Fig.  4  and  has  a  good 
agreement  to  the  measurements  in  Fig.  3.  By  using  a  highly 
nonlinear  fiber  [10],  it  is  possible  to  decrease  the  required 
pulse  power,  or  to  use  a  shorter  fiber,  where  the  dispersive 
walkoff  would  be  less  significant.  Fig.  4  also  shows  results 
for  simulations  with  a  350-m-long  highly  nonlinear  fiber  with 
no  =  5.2  •  10-20m2/W  and  an  effective  area  of  10.7  /u,m2, 
with  the  same  loss  and  dispersion  characteristics  as  the  5-km 
DSF.  It  can  be  seen  that  the  difference  between  the  left  and 
right  sideband  is  negligible  and  that  the  required  input  power 
only  changes  slightly,  due  to  less  dispersive  walkoff.  For  this 
case,  the  simulated  pulsewidths  change  only  slightly  between 
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Fig.  4.  Simulation  of  the  wavelength  converter  using  8-ps  input  pulses  at  1535 
nm  with  5 -km  normal  DSF  and  350-m  highly  nonlinear  DSF. 


Fig.  5.  Simulation  of  the  transfer  function  for  2-ps  input  pulses  at  1535  nm 
using  a  highly  nonlinear  DSF,  aiming  at  160  Gb/s. 

12.1  and  12.5  ps  for  both  sidebands.  It  would  also  be  possible 
to  use  a  5-km-long  highly  nonlinear  DSF,  leading  to  the  same 
behavior  as  the  normal  DSF,  only  that  the  required  optical 
power  would  be  decreased  by  a  factor  of  ten. 

Due  to  the  fast  response  time  of  the  fiber  nonlinearity,  it 
should  be  possible  to  use  this  wavelength  converter  at  even 
higher  speeds.  To  show  this,  a  wavelength  converter  with  350-m 
highly  nonlinear  DSF  was  simulated  with  2-ps  input  pulses, 
aiming  at  160  Gb/s.  The  spacing  of  the  LMF  notches  was  5  nm, 
and  the  bandpass  filter  has  a  3-dB  bandwidth  of  1.8  nm.  Fig.  5 
shows  the  transfer  function  for  four  different  wavelengths  for 
2-ps  input  pulses,  using  the  left  sideband.  The  rise  in  output 
power  for  the  1545-nm  transfer  function  at  700-mW  input  peak 
power  is  due  to  the  spectral  broadening  of  the  pump,  which 
causes  interferometric  crosstalk  on  the  new  wavelength.  Fig.  6 
shows  the  required  input  peak  power  to  reach  the  maximum  of 
the  transmission  function  with  350-  and  35-m  highly  nonlinear 
DSF.  The  general  behavior  is  similar  to  the  40-Gb/s  case  with 
standard  DSF,  where  the  required  input  power  for  the  right 
sideband  starts  increasing  before  the  left  sideband  for  the  long 
fiber.  Also,  the  pulsewidth  of  the  right  sideband  increases  after 
1550  nm,  whereas  the  left  sideband  shows  pulsewidths  between 
2.2  and  2.4  ps  in  the  wavelength  range  1544-1560  nm.  For  the 
35-m  fiber,  the  wavelength  dependence  is  much  smaller,  but 
the  required  input  powers  are  higher. 

IV.  Conclusion 

We  have  investigated  the  wavelength  dependence  of  a 
fiber-based  40-Gb/s  wavelength  converter  and  experimentally 
demonstrated  a  wavelength  conversion  span  of  14  nm  with  a 


Fig.  6.  Required  input  peak  power  to  reach  the  maximum  of  the  transfer 
function  using  2-ps  input  pulses  at  1535  nm  using  350-  and  35-m  highly 
nonlinear  dispersion-shifted  fiber. 

2-dB  variation  in  the  input  power  required  for  maximum  output. 
In  this  range,  the  output  pulsewidth  was  12  ps.  Simulations  can 
reproduce  the  experimental  results,  and  they  show  a  substantial 
improvement  if  a  highly  nonlinear  DSF  could  be  used  [10]. 
For  this  fiber,  the  power  and  pulse-width  changes  are  very 
small  over  a  wavelength  range  of  over  20  nm,  due  to  a  much 
lower  dispersive  walkoff.  This  type  of  wavelength  converter 
has  a  polarization  dependence  originating  from  the  underlying 
XPM  process.  However,  it  can  probably  be  made  polarization 
independent  by  use  of  polarization  scrambling  in  the  dispersion 
shifted  fiber  [8]  or  circularly  birefringent  fiber  [9].  Simulations 
with  2-ps  pulses  show  that  the  scheme  is  probably  scalable  to 
even  higher  bit  rates. 
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All-Optical  Header  Erasure  and  Penalty-Free 
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Abstract  All-optical  erasure  and  rewriting  of  2 £-Gb/s  nonre- 
tum-to-zero  (NRZ)  data  from  high-speed  retum-to-zero  (RZ)  data 
has  been  demonstrated  using  a  fiber-based  wavelength  converter. 
Such  a  wavelength  converter  will  block  the  low  frequencies  of  the 
NRZ  data  while  converting  the  RZ  pulses.  This  approach  uses  the 
frequency  discriminating  feature  of  the  wavelength  converter,  and 
does  not  require  active  control.  Penalty-free  rewriting  of  new  NRZ 
data  was  then  performed. 

Index  Terms — Cross-phase  modulation,  nonlinear  fiber  optics, 
optical  fiber  communication,  optical  networks,  optical  packet 
switching,  wavelength  converters. 

I.  Introduction 

TODAY  wavelength-division  multiplexing  (WDM)  is  well 
established  and  has  been  able  to  support  the  increasing 
demand  for  transmission  capacity.  While  the  performance  of 
present  systems  continues  to  improve,  optical  packet-switching 
technologies  may  be  required  to  deliver  low-latency  packet 
routing  and  forwarding  at  terabit  wire  rates.  The  technologies 
that  are  being  developed  should  also  support  header  era¬ 
sure/rewriting  in  order  to  enable  next-generation  IP  routing 
approaches  like  all-optical  label  swapping  (AOLS)  [l]-[3]  and 
new  simpler  IP  routing  protocols  such  as  multiprotocol  label 
switching  (MPLS)  [4]  to  simplify  route  lookup  and  optical 
implementation. 

In  all-optical  packet  switching,  the  header  can  be  encoded  at 
a  lower  bit  rate  and  inserted  in  front  of  the  payload  [1],  [5],  or 
encoded  on  a  subcarrier  at  the  same  wavelength,  but  outside  the 
data  frequency  band  [3],  At  each  node,  the  header  is  processed 
and  replaced  using  relatively  low-speed  electronics,  while  the 
high-speed  payload  remains  in  the  optical  domain  and  is  trans¬ 
mitted  through  the  node.  A  new  header  is  generated  electron¬ 
ically  and  multiplexed  with  the  payload  to  form  the  outgoing 
packet. 

In  an  optical  switching  node,  a  tunable  wavelength  converter 
combined  with  a  wavelength-selective  element,  can  be  used  as 
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Fig.  1.  Experimental  setup  of  the  XPM  wavelength  converter.  Er-FRL:  Fiber 
nng  laser;  TECL:  Tunable  External  Cavity  Laser,  MOD:  Mach-Zehnder 
modulator;  EDFA:  Erbium-doped  fiber  amplifier;  PC:  Polarization  controller; 
PMF:  Polarization  maintaining  fiber;  DSF:  Dispersion  shifted  fiber. 

a  switching  element.  Here  we  describe  how  a  wavelength  con¬ 
verter  based  on  cross-phase  modulation  in  an  optical  fiber  [6] 
can  be  used  to  passively  remove  a  low-bit-rate  nonretum-to-zero 
(NRZ)  header  from  a  high-speed  retum-to-zero  (RZ)  payload. 
In  the  wavelength  converter,  the  input  data  is  combined  with 
a  local  continuous-wave  (CW)  signal  and  launched  into  a  dis¬ 
persion-shifted  fiber.  The  input  data  will  impose  a  phase  mod¬ 
ulation  on  the  CW  light,  which  can  be  turned  into  an  amplitude 
modulation  by  filtering  out  one  of  the  generated  sidebands.  Such 
a  wavelength  converter  has  a  nonlinear  transfer  function  than 
can  be  used  to  suppress  a  header  to  some  extent,  but  it  also  has  a 
differentiating  nature  which  is  the  main  mechanism  employed  to 
remove  low-frequency  data  used  in  a  header.  A  new  header  can 
then  be  inserted  by  premodulating  the  local  CW  source  in  the 
wavelength  converter  [5],  If  the  CW  source  is  not  very  stable 
small  changes  in  wavelength  will  cause  power  fluctuations  in 
the  output.  Here  we  use  a  separate  source  for  the  new  data  to 
overcome  this  problem. 

II.  Experiment 

In  the  first  experiment,  the  wavelength  converter  was  charac¬ 
terized  with  a  sinusoidal  input.  Fig.  1  shows  the  experimental 
setup  where  8-ps  pulses  with  a  repetition  rate  of  10  GHz  were 
generated  from  an  actively  mode-locked  erbium-doped  fiber 
ring  laser,  and  modulated  with  10-Gb/s  PRBS  data.  (This 
pulsewidth  is  suitable  for  optically  multiplexing  the  10-Gb/s 
data  to  40  Gb/s.)  A  tunable  external  cavity  laser  was  externally 
modulated  with  a  sine  signal  ranging  from  1  to  20  GHz.  These 
two  sources  were  gated  with  acustooptic  modulators  and 
combined  to  form  data  and  sine  packets  as  shown  in  Fig.  2(a). 

In  the  wavelength  converter,  the  input  signal  is  combined  with 
a  local  continuous-wave  (CW)  souce,  and  amplified  before 
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Fig.  2.  Generated  packets  with  a  10  GHz  sine  and  10-Gb/s  R2  data  before 
(a)  and  after  (b)  the  wavelength  convener.  Packets  with  2.5-Gb/s  NRZ  data 
and  10-Gb/s  RZ  data  before  (c)  and  after  (d)  the  wavelength  converter.  It  can 
clearly  be  seen  how  the  RZ  data  is  converted,  while  the  lower  frequency  part  is 
suppressed. 

entering  a  5-km  dispersion-shifted  fiber  (DSF).  The  local  CW 
light  is  spectrally  broadened  by  cross-phase  modulation  from 
the  input  signal,  and  will  have  two  sidebands  at  the  output  of 
the  DSF.  A  loop-mirror  filter  which  has  a  sinusoidal  transfer 
function  (in  the  frequency  domain)  with  repetitive  notches  [6] 
is  used  to  remove  the  original  CW  light.  Then  a  bandpass  filter 
selects  one  of  the  remaining  two  sidebands  with  the  wavelength 
converted  data.  (This  process  can  be  seen  in  Fig.  4.)  Fig.  2(a) 
shows  data  packets  together  with  10-GHz  sine  packets  at  the 
input,  where  the  peak  amplitudes  were  adjusted  to  the  same 
value.  The  extinction  ratio  of  the  sine  signal  is  lower  because 
we  used  the  linear  regime  of  the  Mach-Zehnder  transfer 
function.  In  the  generation  of  the  optical  sidebands  on  the  CW 
signal,  the  input  pulse  power  causes  a  phase  modulation  on  the 
CW  light,  and  the  instantaneous  optical  frequency  deviation 
of  the  CW  light  is  given  by  the  derivative  of  the  optical  phase. 
Thus  the  instantaneous  frequency  deviation  will  depend  on 
the  derivative  of  the  input  pulse  power,  which  is  turned  into 
amplitude  modulation  after  filtering.  Due  to  this  differentiating 
nature,  the  rise  and  fall  times  determine  the  conversion  effi¬ 
ciency,  which  means  that  the  wavelength  converter  will  block 
low  signal  frequencies.  In  Fig.  2(b),  it  can  clearly  be  seen  how 
the  RZ  data  is  converted  and  the  10-GHz  sine  is  suppressed. 
Fig.  3  shows  the  output  peak  power,  normalized  to  the  output 
pulse  peak  power.  When  the  frequency  of  the  sine  is  decreased, 
the  output  power  drops  quickly  to  levels  below  —20  dB  at 
1.24  GHz.  Fig.  3  also  shows  the  suppression  when  the  input 
sine  signal  was  adjusted  to  half  the  RZ  data  amplitude,  where 
the  suppression  is  initially  higher,  but  drops  slower.  Due  to 
the  nonlinear  behavior  of  the  wavelength  converter,  the  output 
will  generally  not  be  a  sine,  but  the  peak  power  is  still  a  good 
measure  of  the  suppression.  For  pulses,  the  pulsewidth,  and 
not  the  bit  rate,  will  determine  the  conversion  efficiency.  It 
should  be  noted  that  NRZ  data  cannot  be  converted,  as  only  the 
edges  of  a  signal  are  converted.  In  a  sequence  of  consecutive 
ONE’S,  only  the  first  ONE  would  generate  an  output  signal. 
However,  there  would  still  be  some  output  that  could  cause 
crosstalk  when  a  new  header  is  inserted,  even  though  the  old 


Fig.  3.  Frequency  dependence  of  the  wavelength  converter  for  a  sine  signal 
when  the  input  sine  peak  power  was  adjusted  to  the  input  pulse  power  and  half 
the  input  pulse  power.  Output  powers  are  normalized  to  the  output  pulse  peak 
power,  and  half  the  output  pulse  peak  power,  respectively. 


Fig.  4.  Optical  spectra  for  the  10-Gb/s  RZ  and  the  2.5-Gb/s  NRZ,  at  the 
wavelength  converter  input,  after  the  dispersion-shifted  fiber,  and  after  the 
wavelength  converter. 

header  data  has  been  corrupted.  Fig.  2(c)  and  (d)  shows  the 
situation  for  packets  of  2.5-Gb/s  NRZ  and  10-Gb/s  RZ,  before 
(c)  and  after  (d)  the  wavelength  converter.  On  the  oscilloscope, 
the  remainder  of  the  2.5-Gb/s  NRZ  data  was  totally  hidden  in 
the  detector  noise.  By  averaging  the  signal,  the  suppression 
was  estimated  to  around  -23  dB,  which  agrees  with  the  sine 
measurements. 
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Average  received  optical  power  in  the  packets  {dBm] 

Fig.  5.  BER  measurements:  (♦)  the  original  2.5-Gb/s  NRZ  data  at  1535  nm; 
(+)  new  2.5-Gb/s  NRZ  data  at  1541  nm  with  EDFA  in  the  wavelength  converter 
turned  off;  (v)  new  2.5-Gb/s  NRZ  data  with  the  wavelength  converter  turned 
on;  and  (o)  new  2.5-Gb/s  NRZ  data  attenuated  5  dB,  otherwise  as  V- 

In  the  second  experiment,  the  10- Gb/s  RZ  data  and  the 
2.5-Gb/s  NRZ  data  were  switched  manually,  to  measure  the 
optical  spectra,  and  to  measure  the  penalty  for  rewriting  new 
2.5-Gb/s  data  where  the  original  data  was  erased.  At  the  input 
to  the  wavelength  converter,  the  average  power  of  the  NRZ  and 
the  RZ  data  were  equal.  Fig.  4  shows  the  optical  spectra  as 
measured  with  an  optical  spectrum  analyzer  at  the  input  to  the 
wavelength  converter,  after  the  DSF,  and  after  the  wavelength 
converter.  It  can  clearly  be  seen  how  the  RZ  data  significantly 
broadens  the  CW  source  in  the  WLC,  whereas  no  broadening 
can  be  noticed  in  the  diagrams  for  the  2.5-Gb/s  data. 

Then  the  bit-error  rate  (BER)  was  measured  for  the  orig¬ 
inal  and  rewritten  header  data  using  continuous  231  —  1 
pseudo-random  data.  Fig.  5  shows  the  back-to-back  measure¬ 
ments  for  the  original  transmitter  at  1535  nm  (•)  as  well  as 
for  the  new  transmitter  at  1541  nm  with  the  EDFA  in  the 
wavelength  converter  turned  off  (+).  The  small  difference  in 
sensitivity  of  0.2  dB  could  be  due  to  minor  differences  in  the 
transmitters  and  the  wavelength.  The  average  power  of  the  new 
data  was  adjusted  to  the  same  average  power  as  the  converted 
RZ  data.  The  bit-error  rate  when  the  wavelength  converter  is 
turned  on  (v)  shows  no  penalty  compared  to  the  case  with  the 
amplifier  turned  off.  The  new  data  is  offset  by  0.5  nm,  which 
means  that  the  crosstalk  will  not  be  interferometric.  With  a 
suppression  of  around  —  20  dB,  one  should  not  see  a  penalty 
and  the  experimental  result  is  expected.  When  the  power  of 
the  new  data  is  attenuated  by  5  dB  (o),  keeping  everything 


else  constant,  a  penalty  of  1  dB  can  be  seen,  which  increases 
to  5-dB  penalty  with  10-dB  attenuation  of  the  new  data.  This 
is  probably  due  to  noise  from  the  amplifier  in  the  wavelength 
converter.  In  this  experiment,  no  BER  measurements  were 
made  for  the  10-  and  40-Gb/s  data,  which  have  been  reported 
elsewhere  for  packets  [5]  and  continuous  data  [6]. 

DDL  Conclusions 

From  the  measurements  it  is  clear  that  this  type  of  wavelength 
converter  can  be  used  to  passively  erase  a  2.5-Gb/s  NRZ  header 
from  a  high-speed  RZ  payload.  Compared  to  other  approaches 
using  time-domain  header  [1],  [2],  no  timing  control  is  required 
to  erase  the  header.  To  insert  a  new  header  timing  control  would 
be  necessary  to  align  the  new  header  to  the  wavelength  con¬ 
verted  payload.  New  2.5-Gb/s  data  can  then  be  rewritten  in  dif¬ 
ferent  ways.  Here  a  separate  transmitter  was  used  for  the  new 
data,  which  showed  no  crosstalk  penalty  from  the  previously 
erased  data.  This  makes  the  system  more  stable  compared  to  the 
approach  of  premodulating  and  slightly  detuning  the  local  CW 
light  in  the  wavelength  converter  as  demonstrated  in  [5],  but  re¬ 
quires  a  high  extinction  ratio  of  the  new  header  source  when  the 
payload  is  present.  Otherwise,  the  local  header  source  would 
cause  the  payload  data  to  be  degraded  due  to  interferometric 
crosstalk.  A  third  option  which  has  not  been  experimentally  in¬ 
vestigated,  is  to  encode  the  new  header  by  frequency-modu¬ 
lating  the  local  CW  laser  in  the  wavelength  converter.  With  the 
present  tuning  speed  of  about  5  ns  this  would  limit  the  bit  rate  to 
a  maximum  of  100  Mb/s,  but  with  further  advances  in  tunable 
laser  technology  this  could  be  an  interesting  option. 
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Abstract — We  demonstrate  the  use  of  a  polarization-indepen¬ 
dent  dual-port  filter  to  extract  subcarrier  multiplexed  (SCM) 
data  from  a  baseband/SCM  channel  while  the  baseband  is  passed 
through  with  minimal  distortion.  This  architecture  solves  the  fiber 
dispersion-induced  fading  problem  for  SCM  channels  and  elim¬ 
inates  the  complexity  of  single-sideband  modulation  techniques 
by  suppressing  the  optical  carrier  at  the  tap  port  prior  to  pho¬ 
todetection.  Bit-error-rate  (BER)  measurements  were  performed 
on  the  baseband  channel  showing  that  the  SCM  removal  prior 
to  photodetection  reduces  crosstalk  with  the  subcarrier  channel. 
The  BER  on  the  SCM  data  shows  no  effect  of  dispersion-induced 
fading.  The  periodic  optical  frequency  characteristic  of  the  filter 
can  be  matched  to  the  ITU  grid,  making  this  tap  applicable  to 
multichannel  WDM  networks. 

Index  Terms — Fiber-loop  mirrors,  optical  filtering,  optical  label 
switching,  optical  networks,  optical  RF  filters,  subcarrier  multi¬ 
plexing,  wavelength  division  multiplexing,  WDM  networks. 


I.  Introduction 

The  UTILIZATION  of  subcarrier  multiplexed  (SCM)  data 
channels  in  optical  networks  has  many  potentially  impor¬ 
tant  roles  including  packet  addressing  [1],  label  removal  for 
all-optical  label  swapping  [2],  performance  monitoring  using 
subcarriers  [3],  and  network  management  and  control  [4].  These 
networks  require  an  efficient  method  to  monitor,  extract,  and 
potentially  erase  subcarrier  information  using  inexpensive  re¬ 
ceivers  and  electronics  [5].  Techniques  to  extract  SCM  data 
from  an  optical  baseband  channel  should  address  critical  issues 
such  as  power  penalties  due  to  fiber  dispersion-induced  subcar¬ 
rier  fading.  Optical  filtering  techniques  have  been  used  to  sim¬ 
plify  SCM  receiver  design  by  prefiltering  a  subcarrier  channel 
prior  to  photodetection  [6]  and  can  be  used  to  recover  a  single 
sideband  optical  subcarrier  channel  and  greatly  reduce  disper¬ 
sion-induced  performance  degradation  [7]. 

In  this  letter,  we  demonstrate  a  new  application  of  the 
fiber-loop  mirror  [8]  as  a  dual-output  filter  to  extract  optical 
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Fig.  1.  Dual-output  SCM  channel  extraction  filter  using  an  FLM.  Data  on 
suppressed  carrier  SCM  channel  are  recovered  using  a  photodetector. 

SCM  (OSCM)  baseband  channels  from  a  combined  base¬ 
band/SCM  data  channel.  The  optical  baseband  channel  passes 
through  the  filter  for  further  processing,  routing,  transmission, 
or  reinsertion  of  new  OSCM  data.  The  OSCM  extraction  port 
produces  a  suppressed  carrier  SCM  signal  that  is  demodulated 
using  a  direct  detection  photodetector.  The  resulting  receiver 
complexity  is  simpler  than  direct  and  coherent  RF  receivers 
combined  with  photodetectors  and  does  not  suffer  the  power 
penalty  due  to  dispersion-induced  SCM  fading.  Thus,  the 
complexity  of  single  sideband  subcarrier  transmitters  is  also 
avoided.  Since  the  filter  is  periodic  in  frequency,  it  can  be 
locked  to  the  ITU  grid,  allowing  multiple  SCM  channels  to 
be  extracted  simultaneously  from  a  WDM  data  stream  while 
preserving  the  baseband  data. 

II.  DESCRIPTION  OF  THE  TAP 

The  tap  is  realized  using  a  fiber-loop  mirror  (FLM)  [8]  with 
a  PM  fiber  as  shown  in  Fig.  1.  The  input  signal  consists  of  a 
combined  optical  baseband  channel  and  SCM  double-sideband 
(DSB)  modulated  channel.  The  FLM  transfer  function  is  set  so 
that  the  OSCM  extraction  port  notch  is  located  at  the  optical 
carrier  frequency.  As  shown  in  Fig.  1,  the  DSB  SCM  data  is 
output  to  the  drop  port  with  its  optical  carrier  suppressed  while 
the  baseband  channel  is  reflected  by  the  FLM  and  passed  to  the 
through  port  using  an  optical  circulator.  The  SCM  baseband  data 
is  recovered  directly  using  a  simple  photodetector.  The  suppres¬ 
sion  of  the  optical  carrier  at  the  tap  port  eliminates  the  fiber-in¬ 
duced  dispersion  fading. 

A.  FLM  Filter  Architecture 

The  filter  architecture  is  based  on  a  polarization-independent 
linear  FLM  [9]  that  employs  a  section  of  birefringent  fiber  with 
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Fig.  2.  The  optical  transfer  functions  for  the  two  outputs  illustrate  WDM 
operation  by  setting  the  mode  spacing  at  a  submultiple  of  the  ITU  100-GHz 
channel  spacing. 

a  length-birefringence  product  chosen  to  support  a  given  op¬ 
tical  channel  spacing  and  subcanier  frequency.  The  differen¬ 
tial  group  delay  (DGD)  due  to  the  birefringence  translates  into 
an  amplitude  variation  through  an  interference  condition  be¬ 
tween  counterpropagating  optical  signals  at  the  fiber  coupler. 
This  gives  rise  to  a  periodic  transmission  function  between  the 
input  and  the  two  output  ports,  as  shown  in  Fig.  2.  Due  to  the 
Sagnac  configuration,  the  loop  mirror  is  polarization-indepen¬ 
dent  both  for  amplitude  and  phase  response.  The  filter  can  po¬ 
tentially  give  a  very  high  rejection  ratio  in  the  notch  for  a  cou¬ 
pling  ratio  close  to  0.5  [10];  in  our  case,  we  measured  a  notch 
depth  of  30  dB. 

B.  Transmitter  Section 

The  baseband  data  and  SCM  signal  were  encoded  on  the  op¬ 
tical  carrier  by  means  of  a  dual-arm  Mach-Zehnder  LiNb03 
electrooptical  modulator  with  a  3-dB  bandwidth  of  18  GHz. 
The  generation  scheme  of  [11]  was  used:  on  one  arm  of  the 
modulator,  10-Gb/s  data  were  encoded,  while  the  other  arm  was 
fed  with  a  100-Mb/s  ASK  modulated  16.7-GHz  RF  tone.  This 
system  is  a  compact  way  to  multiplex  subcarrier  and  data  but  re¬ 
sults  in  a  tradeoff  of  the  extinction  ratio  for  the  baseband  trans¬ 
mission  and  introduces  several  distortion  mechanisms:  in  order 
to  transmit  the  subcarrier,  the  baseband  modulation  depth  must 
be  decreased  and  the  Mach— Zehnder  transfer  function  nonlin¬ 
earity  introduces  distortion  between  baseband  and  subcarrier 
data.  This  translates  to  a  power  penalty  for  baseband  bit-error 
rate  (BER)  with  respect  to  the  pure  baseband  optimal  transmis¬ 
sion.  These  effects  can  be  minimized  with  proper  adjustment 
of  the  modulator  bias,  data  amplitude,  and  subcarrier  power  but 
cannot  be  eliminated  once  the  signal  is  generated. 

C.  Receiver  Section 

The  use  of  optical  prefiltering  greatly  reduces  receiver  com¬ 
plexity  for  OSCM  signals.  The  elimination  of  the  optical  car¬ 
rier  with  baseband  data  allows  direct  detection  of  the  OSCM 
data  without  the  need  for  electrical  envelope  detection  of  the 
RF  signal.  Once  the  optical  earner  has  been  suppressed,  the 
OSCM  data  is  additive  in  power  and  therefore  the  dispersion- 
induced  phase  shift  between  the  two  sidebands  is  eliminated. 


Fig.  3.  Schematic  explanation  of  the  effect  of  a  narrow  notch  filter:  The  filter 
does  not  fully  suppress  the  low-frequency  tones  of  baseband  long  patterns, 
inducing  crosstalk  at  the  subcarrier  detection  port  (left).  The  problem  may  be 
overcome  by  using  a  flattened  notch  (right). 

For  the  baseband,  optical  prefiltering  suppresses  the  subcarrier 
prior  to  square  law  detection  at  the  receiver,  and  greatly  reduces 
crosstalk  between  the  baseband  and  the  subcarrier  data.  This 
degradation  is  additive  to  the  distortion  generated  at  the  trans¬ 
mitter,  as  described  above. 

ni.  Experimental  Results 

The  loop  was  constructed  with  birefringent  fiber  with  a  mea¬ 
sured  polarization  dispersion  of  1.825  ps/m.  A  WDM  system 
with  100-GHz  channel  spacing  with  16.7-GHz  SCM  frequency 
requires  a  filter  periodicity  of  33.4  GHz  which  corresponds  to  a 
DGD  of  29.9  ps.  The  necessary  length  of  PM  fiber  is  then  16.5 
m.  The  filter  3-dB  bandwidth  is  1 6.7  GHz,  suitable  for  transmit¬ 
ting  baseband  data  up  to  10  Gb/s.  Higher  bit  rates  are  limited  by 
the  bandwidth  of  the  filter,  not  by  the  polarization  dispersion. 
In  Fig.  2,  we  report  the  two  complementary  transfer  functions 
of  the  filter  we  realized  and  used  in  the  experiment.  The  pic¬ 
tures  are  taken  with  a  resolution  of  20  pm.  The  notch  depth 
shown  in  the  picture  is  limited  by  the  power  dynamic  of  the 
OSA.  The  difference  in  insertion  loss  is  due  to  the  fact  that  the 
through  port  signal  passes  twice  in  the  isolators  and  in  the  exper¬ 
imental  setup,  also  through  an  APC/FC  adapter.  The  actual  peri¬ 
odicity  is  slightly  different  from  the  nominal  100-GHz  spacing, 
but  it  can  prove  the  point  of  potential  compatibility  with  a  WDM 
ITU  grid.  The  transmission  performance  was  measured  using  a 
10-Gb/s  2'  —  1 PRBS  combined  with  a  231  - 1  PRBS  100-Mb/s 
ASK  modulated  16.7-GHz  subcanier  using  the  transmitter  ar¬ 
chitecture  described  in  [11].  The  reason  for  having  27  —  1  in 
the  baseband  is  due  to  crosstalk  between  the  modulated  base¬ 
band  and  the  subcanier.  The  pattern  length  is  critical  since,  for 
longer  pattern  lengths,  the  low-frequency  components  fall  into 
the  sub  earner  data  band.  In  our  case,  the  notch  is  so  narrow  that 
it  eliminates  the  carrier  but  cannot  as  effectively  erase  all  the 
low-frequency  components  of  the  baseband.  In  the  case  of  short 
word  lengths,  the  leftover  tones  are  more  widely  spaced  and  the 
disturbances  can  be  effectively  reduced  with  electrical  filtering 
after  the  detector.  The  problem  could  be  avoided  by  using  a  flat¬ 
tened  notch  [12]  in  order  to  have  enough  rejection  on  a  wider 
bandwidth,  as  shown  in  Fig.  3. 

A.  Baseband  Data 

In  Fig.  4,  a  back-to-back  BER  measurement  for  the  10-Gb/s 
channel  without  the  SCM  channel  (denoted  by  x )  is  compared 
to  the  direct  detection  of  the  10  Gb/s  channel  in  the  presence  of 
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Fig.  4.  Back-to-back  BER  for  10-Gb/s  baseband,  x:  baseline,  optimal 
conditions  for  baseband  transmission  without  SCM  channel.  *:  baseband 
multiplexed  with  subcarrier  channel  before  the  LMF.  o:  BER  after  prefiltering 
with  LMF. 


Fig.  5.  Calculated  received  RF  power  for  a  DSB  1 6.7  GHz  signal  with  D  —  17 
ps/nm-km  and  a  =  0.2  dB/km  (Solid  Curve)  with  data  point  corresponding 
to  the  experimental  conditions.  Eye  diagrams,  measured  using  the  LMF  with 
various  fiber  lengths  show  elimination  of  fading,  in  particular  the  insets  show 
that  the  eyes  diagrams  back  to  back  and  after  10  km  (the  null  in  the  fading  curve) 
are  identical. 


the  SCM  channel  before  and  after  the  fiber  loop  mirror  filter 
(LMF)  (denoted  by  *  and  o,  respectively).  The  SCM  multi¬ 
plexed  signal  (*)  shows  a  degradation  both  in  the  BER  slope 
and  receiver  sensitivity  that  can  be  explained  by  the  way  the 
mixed  subcarrier/baseband  signal  is  generated  as  previously  ex¬ 
plained  in  Section  II-B  and  to  the  intermodulation  crosstalk  at 
the  receiver  (Section  3I-B).  This  last  effect  is  significantly  re¬ 
duced  by  optical  prefiltering,  as  it  results  from  the  improvement 
of  the  BER  slope  measured  after  the  FLM  (o).  The  remaining 
2-dB  power  penalty,  apart  from  the  already  mentioned  distor¬ 
tion  at  the  transmitter,  is  due  to  distortion  by  the  LMF  and  other 
LMF  instabilities  as  discussed  in  Section  IQ-B. 

B.  Subcarrier  Data 

The  receiver  sensitivity  (for  a  BER  of  10“ ' )  of  the  suppressed 
carrier  SCM  detection  was  measured  constant  over  more  than 
60  km  of  standard  fiber,  as  reported  in  Fig.  5.  We  limited  the 
measurement  of  the  BER  to  10“7  also  because  the  filter  was 
sensitive  to  mechanical  and  temperature  disturbances;  however, 
we  are  currently  implementing  active  stabilization  of  the  filter 
for  improved  performance.  In  the  lower  picture  of  Fig.  5,  the 
fading  curve  calculated  for  our  experimental  conditions  can  be 
seen.  A  null  is  present  at  12  km,  but  the  inset  in  the  upper  picture 
clearly  shows  that,  even  in  that  case,  the  eye  diagram  is  clearly 
open. 

IV.  Conclusion 

We  have  proposed  and  demonstrated  a  subcarrier  tap  based 
on  a  dual-port  polarization-independent  LMF.  This  approach 
enables  extraction  of  a  double-sideband  optical  subcarrier  data 
channel  from  a  baseband  channel  with  minimal  distortion  to  the 
baseband.  Suppression  of  the  optical  carrier  at  the  drop  port  al¬ 
lows  for  double-sideband  subcarrier  transmission  without  dis¬ 
persion-induced  fading  and  allows  data  recovery  using  a  simple 
photodetector.  Distortion  of  the  recovered  baseband  is  reduced 
due  to  a  decrease  in  the  intermodulation  distortion  components 
at  the  photodetctor.  In  the  future,  the  periodic  nature  of  the  LMF 


transfer  function  may  be  utilized  for  simultaneous  extraction  of 
multiple  SCM  channels  from  a  WDM  stream  at  the  input.  Due 
to  the  ultrawide  bandwidth  of  the  transfer  function,  this  filter 
can  also  be  locked  to  the  ITU  grid. 
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Abstract — All-optical  label  swapping  is  a  promising  approach 
to  ultra-high  packet-rate  routing  and  forwarding  directly  in  the 
optical  layer.  In  this  paper,  we  review  results  of  the  DARPA  Next 
Generation  Internet  program  in  all-optical  label  swapping  at  Uni¬ 
versity  of  California  at  Santa  Barbara  (UCSB).  We  describe  the 
overall  network  approach  to  encapsulate  packets  with  optical  la¬ 
bels  and  process  forwarding  and  routing  functions  independent 
of  packet  bit  rate  and  format  Various  approaches  to  label  coding 
using  serial  and  subcarrier  multiplexing  addressing  and  the  asso¬ 
ciated  techniques  for  label  erasure  and  rewriting,  packet  regener¬ 
ation  and  packet-rate  wavelength  conversion  are  reviewed.  These 
functions  have  been  implemented  using  both  fiber  and  semicon¬ 
ductor-based  technologies  and  the  ongoing  effort  at  UCSB  to  inte¬ 
grate  these  functions  is  reported.  We  described  experimental  re¬ 
sults  for  various  components  and  label  swapping  functions  and 
demonstration  of  40  Gb/s  optical  label  swapping.  The  advantages 
and  disadvantages  of  using  the  various  coding  techniques  and  im¬ 
plementation  technologies  are  discussed. 

I.  Introduction 

INTERNET  traffic  is  growing  exponentially  and  is  forcing 
next  generation  Internet  Protocol  (IP)  networks  to  scale 
far  beyond  present  speed,  capacity,  performance,  and  packet 
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forwarding  rates.  In. fiber-based  optical  networks,  next  gener¬ 
ation  optical  IP  routing  will  require  technologies  to  support 
packet  routing  and  forwarding  operations  at  Terabit  wire  rates 
that  are  compatible  with  wavelength  division  multiplexed 
(WDM)  transmission  and  routing.  Up  to  50%  of  IP  traffic 
consists  of  packets  smaller  than  522  bytes  and  50%  of  these 
packets  are  in  the  40-44  byte  range.  New  low-latency  packet 
forwarding  and  routing  technologies  will  be  required  that  can 
handle  wire-rate  routing  of  the  smallest  packets  at  rates  in 
excess  of  Gigapackets/s.  These  technologies  should  support 
new  streamlined  IP  routing  protocols  such  as  Multiprotocol 
Label  Swapping  (MPLS)  [1]  that  simplify  route  lookup  and 
separate  the  routing  and  forwarding  functions. 

All-Optical  Label  Swapping  (AOLS)  [2]-[4]  implements  the 
packet-by-packet  routing  and  forwarding  functions  of  MPLS 
directly  in  the  optical  layer.  This  is  in  contrast  to  next  gener¬ 
ation  commercial  optical  networks  that  will  use  MPLS  to  set 
up  optical  circuits  over  which  flows  of  electronic  packets  are 
routed  [5].  For  future  optical  networks,  it  will  be  desirable  to 
perform  optical  routing  layer  functions  independent  of  the  IP 
packet  length  and  payload  bit  rate.  AOLS  approaches  should  in¬ 
terface  seamlessly  to  both  WDM  and  TDM  systems.  This  leads 
to  the  concept  of  using  optical  labels  to  “encapsulate”  the  IP 
packet  in  the  optical  layer.  The  optical  label  format  is  chosen  to 
best  match  the  optical  routing  and  forwarding  technology  and 
to  satisfy  other  constraints  in  the  optical  layer  (e.g.,  the  current 
lack  of  flexible  optical  buffering  technologies). 

In  this  paper,  we  describe  AOLS  research  at  the  University 
of  California  at  Santa  Barbara  (UCSB)  supported  under  the 
DARPA  sponsored  Next  Generation  Internet  (NGI)  program 
and  the  DARPA  sponsored  Center  for  Multidisciplinary  Optical 
Switching  Technology  (MOST)  and  the  NGI  partner  Cisco 
Systems  and  NGI  industrial  affiliates  New  Focus,  Agilent 
Technologies,  and  Netcom  Systems  and  performed  in  collabo¬ 
ration  with  the  DARPA  supported  NGI  program  at  Telecordia 
Technologies. 

The  paper  is  organized  into  seven  sections.  Section  II  de¬ 
scribes  the  overall  concept  of  AOLS,  its  application  in  optical 
networks  and  optical  Internet  protocol  routers.  The  key  building 
blocks  used  to  implement  AOLS  are  also  described.  In  Sec¬ 
tion  ID  we  review  two  different  label-coding  techniques  that 
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Fig.  1.  Optical  label  swapping  with  wavelength  conversion  in  an  optical  core  network  using  edge  and  core  WDM  IP  routers. 


Fig.  2.  Layered  routing  and  forwarding  hierarchy  and  associated  network  element  connection  diagram  for  an  AOLS  network. 


have  been  used  to  date.  Section  IV  introduces  issues  specific 
to  building  multihop  AOLS  networks.  Several  technologies  that 
have  been  used  to  realize  AOLS  building  blocks  are  covered  in 
Section  V  and  the  AOLS  subsystems  are  covered  in  Section  VI. 
Systems  experimentally  demonstrated  at  UCSB  are  described  in 
Section  VII. 

n.  All-Optical  Label  Swapping  (AOLS) 

An  AOLS  optical  packet  core  network  is  illustrated  in  Fig.  1. 
IP  packets  enter  the  core  network  at  an  ingress  router  and 
travel  multiple  hops  through  the  core,  exiting  at  an  egress 
router.  Packets  are  handled  within  the  network  by  core  AOLS 
routers  or  AOLS  subnets,  as  described  in  [2].  Fig.  2  depicts 
the  physical  network  elements  connected  by  fiber  links  and 
the  packet  routing  and  forwarding  hierarchy.  IP  packets  are 
generated  at  the  electronic  routing  layer  and  processed  in 
an  adaptation  layer  that  “encapsulates”  IP  packets  with  an 
optical  label  without  modifying  the  original  packet  structure. 
The  adaptation  layer  also  shifts  the  packet  and  label  to  a  new 


wavelength  specified  by  local  routing  tables.  An  optical  multi¬ 
plexing  layer  multiplexes  labeled  packets  onto  a  shared  fiber 
medium.  Several  optical  multiplexing  approaches  may  be  used 
including  insertion  directly  onto  an  available  WDM  channel, 
packet  compression  through  optical  time  division  multiplexing 
or  time  interleaving  through  optical  time  division  multiplexing 
[6].  This  technique  is  not  limited  to  IP  packets  and  other  packet 
or  cell  structures  like  ATM  may  also  be  routed. 

Once  inside  the  core  network,  core  routers  or  AOLS  sub¬ 
nets  [2]  perform  routing  and  forwarding  functions.  The  routing 
function  computes  a  new  label  and  wavelength  from  an  internal 
routing  table  given  the  current  label,  current  wavelength,  and 
fiber  port.  The  routing  tables  (at  egress  and  core  routers)  are 
generated  by  converting  IP  addresses  into  smaller  pairs  of  labels 
and  wavelengths  and  distributing  them  across  the  network  much 
in  the  same  way  that  multiprotocol  label  switching  (MPLS)  is 
used  in  today’s  IP  networks  [1].  The  forwarding  function  in¬ 
volves  swapping  the  original  label  with  the  new  label  and  physi¬ 
cally  converting  the  labeled  packet  to  the  new  wavelength.  Other 
switching  or  buffering  mechanisms  (space,  time,  etc.)  are  also 


2060 


JOURNAL  OF  UGHTWAVE  TECHNOLOGY.  VOL.  18,  NO.  12.  DECEMBER  2000 


IP  Bit  Rate  e.g.,  2.5  Gbps  -40  Gbps 


(b) 

Fig.  3.  Optical  label  coding  techniques:  (a)  fixed  rate  serial  label  and  (b)  optical  subcarrier  multiplexed  label. 


configured  in  the  forwarding  process.  The  reverse  process  of  op¬ 
tical  demultiplexing,  adaptation  and  electronic  routing  are  per¬ 
formed  at  the  egress  node. 

III.  Optical  Label  Coding  Techniques 

The  method  of  coding  the  label  onto  a  packet  impacts  the 
channel  bandwidth  efficiency,  the  transmission  quality  of  the 
packet  and  label,  and  the  best  method  to  wavelength  convert  the 
packet  and  optically  swap  the  label.  Two  approaches  to  optical 
label  coding  are  the  serial  label  [7],  [8]  and  the  optical  subcar¬ 
rier  multiplexed  label  [3],  [9]— [1 1],  as  illustrated  in  Fig.  3.  With 
serial  coding  a  fixed  bit  rate  label  is  multiplexed  at  the  head  of 
the  IP  packet  with  the  two  separated  by  an  optical  guard-band 
(OGB)  as  shown  in  Fig.  3(a).  The  OGB  is  use  to  facilitate  label 
removal  and  reinsertion  without  static  packet  buffering  and  to 
accommodate  finite  switching  times  of  optical  switching  and 
wavelength  conversion.  The  bit-serial  label  is  encoded  on  the 
same  optical  wavelength  as  the  IP  packet  and  is  encoded  as 
a  baseband  signal.  For  optical  subcarrier  multiplexed  labels 
a  baseband  label  is  modulated  onto  a  RF  subcarrier  and  then 
multiplexed  with  the  IP  packet  on  the  same  wavelength  [see 
Fig.  3(b)].  This  multiplexing  may  be  performed  electronically 
or  optically  as  described  in  [2]-[4],  [12]-[14].  An  OGB  is  not 
necessary  in  the  subcarrier  case  since  the  label  is  transmitted 
in  parallel  with  the  packet.  It  is  only  necessary  that  the  label 
fit  within  the  boundaries  of  the  packet,  however  an  OGB  may 
be  used  if  accumulated  misalignment  of  the  label  and  payload 
occurs  during  multiple  hops. 

Packet  transparency  is  realized  by  setting  a  fixed  label  bit  rate 
and  modulation  format  independent  of  the  packet  bit  rate.  The 
choice  of  label  bit  rate  is  driven  by  a  combination  of  factors 
including  the  speed  of  the  burst-mode  label  recovery  electronics 
and  the  duration  of  the  label  relative  to  the  shortest  packets  at  the 
fastest  packet  bit  rates.  Additionally,  running  the  label  at  a  lower 
bit  rate  allows  the  use  of  lower  cost  electronics  to  process  the 


label.  The  label  and  packet  bits  can  be  encoded  using  different 
data  formats  to  facilitate  data  and  clock  recovery.  For  example,  if 
the  IP  packet  is  compressed  to  40  Gb/s  using  an  RZ  data  format, 
the  label  can  be  encoded  at  2.5  Gb/s  using  NRZ  format.  A  20-bit 
label  transmitted  at  2.5  Gb/s  occupies  the  same  duration  as  a 
40-byte  packet  transmitted  at  40  Gb/s. 

Encapsulation  of  IP  packets  using  optical  labels  has  advan¬ 
tages  in  that  the  contents  of  the  original  IP  packet  are  not  mod¬ 
ified  and  the  label  is  coded  at  the  same  wavelength  as  the  IP 
packet.  In  the  serial  case,  erasure  and  rewriting  of  the  label  may 
be  performed  independently  of  the  IP  packet  bit  rate,  however, 
timing  of  the  label  replacement  and  possibly  erasure  process  is 
somewhat  time  critical.  The  subcanier  labels  have  the  advantage 
that  they  can  be  removed  and  replaced  more  asynchronously 
with  respect  to  the  packet  but  potentially  suffer  from  dispersion 
induced  fading,  a  problem  that  can  be  remedied  as  described  in 
a  later  section. 

IV.  AOLS  Functions  and  Building  Blocks 

Label  recovery,  label  swapping  and  packet  forwarding  are  the 
basic  functions  handled  by  the  AOLS  building  blocks,  shown 
in  Fig.  4.  In  a  core  router  or  AOLS  subnet  [see  Fig.  4(a)],  a 
burst-mode  label  recovery  module  is  used  to  recover  label  clock 
and  data  for  processing  in  electronic  routing  circuitry  without 
significantly  perturbing  the  through-going  optical  packet  data. 
The  routing  circuit  maps  the  incoming  label  and  wavelength  to 
a  new  label  and  wavelength  based  on  internal  routing  tables. 
The  label  erasure  process  may  be  built  into  this  stage  depending 
on  the  implementation  technology.  The  second  stage  performs 
label  erasure  (if  not  performed  in  the  first  stage),  wavelength 
conversion  and  label  rewriting  using  a  wavelength  converter  or 
combination  of  converter  and  optical  filter.  A  fiber  optic  delay 
follows  the  label  recovery  stage  to  match  the  routing  processing 
delay  with  the  arrival  of  the  packet  at  the  second  optical  stage. 
An  edge  router  [see  Fig.  4(a)]  utilizes  a  slightly  modified  first 
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Fig.  4.  Building  blocks  for  AOLS  (a)  core  router  and  (b)  core  optical  routers  or  subnets. 


stage  to  detect  the  IP  packet  header  with  minimal  disturbance  to 
the  packet.  The  second  stage  performs  the  same  wavelength  con¬ 
version  and  label  rewriting  processes  as  in  the  core  router,  how¬ 
ever,  the  wavelength  conversion  may  be  optoelectronic-optic 
(OEO)  depending  on  the  network  architecture. 

v.  Issues  in  Multihop  AOLS  Networks 

Routing  in  multihop  optical  packet  network  brings  to  the  table 
a  complex  set  of  issues  that  must  take  into  account  the  integrity 
of  data  transmission  and  the  bursty  nature  of  the  data.  We  briefly 
discuss  the  three  issues  of  routing  in  a  burst-mode  environment, 
cascadability,  and  buffering.  Each  issue  is  complex  enough  to 
dedicate  a  paper  on  the  topic  and  many  papers  on  packet  con¬ 
gestion  and  buffering  have  been  written  to  date  [11],  [15], 

A.  Burst  Mode  Routing 

The  main  issues  in  burst-mode  routing  are  summarized  in 
Fig.  5.  Packets  arrive  at  a  routing  node  with  unknown  amplitude 
and  phase  relative  to  a  local  time  reference.  The  inter-packet 
spacing  is  also  unknown.  For  subcarrier  coded  labels,  the  data 
envelope  must  be  recovered  from  a  detected  subcanier  with  un¬ 
known  RF  phase.  For  any  coding  technique,  data  and  clock  re¬ 


channel  must  also  be  able  to  accommodate  burst-mode  opera- 
tion.  The  process  of  label  erasure,  wavelength  conversion,  label 
rewriting,  and  potential  packet  regeneration  must  operate  under 
similar  conditions  as  the  label  receiver,  particularly  with  respect 
to  amplitude  dynamic  range. 

B.  Congestion  and  Buffering 

Optical  packet  collisions  can  occur  at  any  core  router  node 
or  at  the  edge  egress  points.  Additionally,  congestion  within 
the  packet  handling  hardware  can  occur  if  the  label  routing  and 
packet  forwarding  functions  are  not  pipelined  properly.  The  cu¬ 
mulative  interface  delay,  which  is  defined  as  the  time  involved  to 
route  a  packet  from  the  input  port  to  the  output  port  of  a  router, 
is  on  the  order  of  tens  of  micrometers  in  today’s  state-of-the-art 
routers.  However  with  an  optimally  designed  label-processing 
and  routing  unit,  the  time  required  to  process  a  header  should 
be  less  than  100  ns.  In  a  bursty  network,  inter-packet  time  inter¬ 
vals  are  not  constant,  therefore,  the  incoming  packets  may  have 
to  be  buffered  while  a  packet  is  being  routed.  The  improvement 
m  the  packet  processing  time  reduces  the  buffering  time  and 
thereby  decreases  latency.  The  routing  function  should  incor- 


covery  must  be  performed  without  a  priori  knowledge  of  the 
optimal  threshold  or  clock  phase.  To  efficiently  use  the  label 
channel,  the  header  should  be  extracted  within  the  first  few  bits. 
These  qualities  are  addressed  in  burst-mode  receiver  technology 
described  in  several  publications  [16],  [17].  The  optical  packet 


porate  packet  forwarding,  label  erasure  and  rewriting  the  new 
label.  The  new  label  must  be  replaced  with  minimum  knowl¬ 
edge  of  the  payload  contents.  Wavelength  conversion  can  facili¬ 
tate  all  the  above  functions.  The  subject  of  packet  buffering  and 
contention  is  not  a  focus  of  this  paper,  and  there  have  been  mul¬ 
tiple  publications  on  this  subject  [11],  [15]. 
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Fig.  5.  At  each  router,  label  recovery/swapping  and  packet  routing  and  forwarding  must  be  performed  on  data  that  arrives  on  an  unknown  wavelength  with 
unknown  phase  and  amplitude. 
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Fig.  6.  General  model  of  an  AOLS  wavelength  converter  subsystem. 


VI.  AOLS  Technologies 

In  thus  section,  we  review  various  technologies  that  have  been 
used  to  build  AOLS  subsystems  and  systems  demonstrators. 
These  technologies  are  classified  into  AOLS  wavelength  con¬ 
verters,  fiber  loop  mirrors,  wavelength  agile  sources,  optical  de¬ 
multiplexers  and  packet  transmitters. 

A.  AOLS  Packet  Wavelength  Converters 

In  general,  an  AOLS  packet  wavelength  converter  per¬ 
forms  the  functions  of  label  optical  label  erasure,  packet  rate 
wavelength  conversion,  packet  regeneration  and  optical  label 
rewriting.  A  general  model  of  an  AOLS  wavelength  converter 
subsystem  including  the  label  tap  is  shown  in  Fig.  6.  Shown  at 
the  inputs  of  Fig.  6  are  the  two  possible  types  of  labeled  packet 
inputs  (serial  or  OSCM).  Depending  on  the  combination  of 
label  coding  technique  and  wavelength  converter  technology, 
the  label  tap  may  or  may  not  handle  the  label  erasure  function. 
A  tap  that  handles  the  function  of  erasure  and  label  recovery  is 
described  in  the  next  section  on  fiber  loop  mirrors. 


AOLS  wavelength  converters  have  the  general  capability  to 
simultaneously  erase  and  rewrite  during  the  conversion  process. 
If  the  label  is  placed  in  a  properly  designed  region  of  the  op¬ 
tical  or  electrical  bandwidth  or  intensity,  the  packet  data  at  (A*) 
is  transferred  over  to  the  new  wavelength  (A j)  and  the  label 
on  the  original  wavelength  is  not  transferred  to  (A j).  At  the 
same  time,  the  new  label  may  be  pre-modulated  onto  the  new 
wavelength  source  (Aj)  and  will  appear  with  the  wavelength 
converted  packet  at  (A j)  and  not  be  transferred  to  the  original 
wavelength  (A*).  Certain  wavelength  converter  geometries  also 
allow  the  new  header  to  be  written  into  the  wavelength  converter 
module  directly. 

The  packet  wavelength  converter  is  also  used  to  perform 
signal  regeneration  of  the  packet  to  support  multihop  operation. 
This  regeneration  is  at  least  2R  where  the  extinction  ratio 
is  enhanced  and  the  “0”  and  “1”  level  noise  reduced.  3R 
regeneration  with  retiming  may  also  be  performed  [18]. 

1)  Semiconductor  Optical  Amplifier  (SOA)  Based  Con¬ 
verters:  Cross-gain  modulation  (XGM)  in  SOAs  is  a  well 
understood  effect  that  allows  an  intensity-modulated  signal  to 
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Fig.  7.  SOA-based  MZI-WC  layout  of  UCSB  device. 


2)  Fiber  XPM  Wavelength  Converter:  To  allow  operation  at 
ultrahigh  bit  rates  we  have  demonstrated  a  new  wavelength  con¬ 
verter  based  on  nonlinearities  in  an  optical  fiber  [23]  that  sup¬ 
ports  operation  at  40  Gb/s  and  beyond  (possibly  up  to  several 
hundered  gigabits  per  second).  In  this  scheme,  we  utilize  XPM 
in  a  dispersion  shifted  fiber  imposed  from  the  incoming  data 
onto  CW  light,  followed  by  conversion  of  the  phase  modula¬ 
tion  to  amplitude  modulation  using  optical  filtering,  as  shown  in 
Fig.  8.  The  incoming  data  is  combined  with  a  CW  light  signal 
and  sent  through  an  optical  fiber  where  the  data  imposes  a  phase 
modulation  onto  the  CW  light  via  fiber  XPM.  This  phase  mod¬ 
ulation  generates  optical  sidebands  on  the  CW  signal,  which 
are  converted  to  amplitude  modulation  by  suppressing  the  orig¬ 
inal  CW  carrier  using  an  optical  notch  filter.  A  conventional 
band-pass  filter  to  select  one  of  the  two  sidebands  then  follows 


be  transferred  from  wavelength  to  another  wavelength  [19], 
[20],  Operation  at  10  Gb/s  or  more  is  possible  using  this 
technique  and  polarization  insensitive  conversion  is  possible 
using  amplifiers  with  polarization  insensitive  gain  [21].  XGM 
technique  decreases  the  extinction  ratio  of  the  output  signal 
(8  dB)  and  is  asymmetric  for  frequency-up-conversion  and 
frequency-down-conversion.  XGM  can  also  introduce  chirping 
and  amplitude  distortion  unless  designed  properly.  This  con¬ 
version  technique  is  bit  rate  transparent  out  to  a  maximum 
designed  frequency  and  works  for  amplitude  modulation 
schemes  only.  The  XGM-WC  is  also  an  inverting  converter 
and  must  be  used  in  conjunction  with  a  secondary  device  to 
provide  noninverting  operation  that  is  important  for  multihop 
applications. 

The  cross  phase  modulation  effect  is  utilized  in  an  interfer¬ 
ometer  configuration  and  can  be  realized  using  SOA-based 
waveguide  structures.  Fig.  7  shows  a  Mach-Zehnder  interfer¬ 
ometer  wavelength  converter  (MZI-WC)  under  fabrication  at 
UCSB  under  the  NGI  program.  The  input  signal  passes  through 
one  of  the  interferometer  arms  and  optically  modulates  the 
phase  of  that  arm.  The  interferometric  effect  converts  this  phase 
modulation  to  an  amplitude  modulation  as  the  probe  signal 
traverses  both  arms.  XPM  produces  a  high  extinction  ratio 
for  the  converted  signal  (better  than  15  dB)  due  to  its  highly 
nonlinear  transfer  characteristic.  The  device  may  be  operated 
in  an  inverting  or  noninverting  mode,  however,  the  threshold 
characteristic  of  the  XPM-WC  restricts  the  dynamic  range  of 
the  input  signal  such  that  the  average  signal  level  rests  at  the 
proper  transfer  characteristic  operating  point.  Due  to  its  high 
quality  output  signal,  this  technique  represents  an  attractive 
solution  for  AOLS  systems.  In  a  later  section  on  AOLS  im¬ 
plementations  we  describe  how  to  combine  XGM  and  XPM 
converters  to  gain  the  full  functionality  of  and  AOLS-WC. 
The  signal  integrity  can  be  compromised  by  accumulated 
distortion  of  the  payload  and  label  due  to  fiber  dispersion  and 
nonlinearities  and  accumulated  SNR  degradation  due  optical 
amplifier  noise  and  system  nonlinearities  [3],  [4].  Dispersion 
induced  bit  broadening,  and,  in  particular,  PMD  [22],  is  a 
very  important  factor  in  high-speed  (40  Gb/s  and  higher  bit 
rate)  systems.  In  general,  the  SNR  and  bit  shape  of  packets 
with  bit  rate  of  10  Gb/s  and  greater  will  have  to  be  managed 
or  regenerated  in  multihop  systems.  Using  2R  regeneration 
during  the  wavelength  conversion  process  can  address  the 
dispersion-induced  bit  broadening  and  SNR  degradation  issues. 


the  notch  filter.  However,  in  principle  it  is  sufficient  to  use  a 
single  band  pass  filter,  e.g.,  a  fiber  bragg-grating  or  fiber-loop 
mirror,  provided  that  it  can  suppress  the  original  CW  carrier 
enough. 

Fig.  9  shows  the  eye-pattern  (a)  before  and  (b)  after  wave¬ 
length  conversion  of  40  Gb/s  data.  The  polarization  dependence 
for  XPM  in  a  fiber  is  5  dB  and  the  polarization  dependence 
as  measured  with  a  power  meter  for  the  wavelength  converter 
could  vary  between  2  and  8  dB,  depending  on  the  position  and 
bandwidth  of  the  band-pass  filter.  However,  the  polarization  de¬ 
pendence  can  be  eliminated  by  utilizing  polarization  scrambling 
[24]  or  circularly  polarized  fiber  [25],  Even  though  a  long  fiber 
was  used,  no  stability  problems  were  observed  and  the  system 
was  only  mechanically  polarization  sensitive  before  combining 
the  data  and  CW  light.  Since  the  filter  arrangement  was  polariza¬ 
tion  insensitive  environmental  disturbances  to  the  DSF  did  not 
affect  the  performance  apart  from  a  small  timing  shift,  which 
can  be  compensated  for  by  a  clock  recovery  circuit. 

Since  this  scheme  relays  on  optical  filtering  the  issue  of  wave¬ 
length  tunability  of  the  filters  is  critical.  The  wavelength  con¬ 
verter  must  have  the  ability  to  be  reconfigured  to  convert  to  an 
arbitrary  wavelength  within  a  few  nanoseconds.  Tunable  lasers 
are  today  available  that  can  be  tuned  to  an  arbitrary  wavelength 
within  less  than  a  nanosecond  [26]  but  tunable  filters  are  usu¬ 
ally  not  very  fast.  In  a  practical  implementation,  an  important 
feature  of  a  wavelength  converter  is  to  allow  no  conversion  and 
conversion  to  wavelength  channels  near  the  input  wavelength. 
Using  this  type  of  wavelength  converter  scheme,  this  can  be 
achieved  by  simply  switching  the  CW  laser  off  and  move  the 
output  filter  to  the  actual  wavelength  and  filter  out  a  part  of  the 
due  to  self-phase  modulation  broadened  spectrum  [27].  Another 
important  issue  is  cascadability  of  multiple  wavelength  con¬ 
verters.  Due  to  the  inherent  nonlinear  transmission  and  virtually 
instantaneous  response  of  the  wavelength  converter  it  should  be 
well  suited  for  multi-hop  operation.  Simulations  show  that  more 
than  100  hops  can  be  passed  without  any  significant  degradation 
in  SNR  or  timing  jitter.  This  is,  however,  subject  to  further  ex¬ 
perimental  investigations. 

B.  Suppressed  Carrier  SCM  Taps  Using  Fiber  Loop 
Mirrors  (FLMs) 

If  double  sideband  SCM  modulation  is  used  to  encode 
labels,  periodic  signal  fading  occurs  due  to  fiber  dispersion. 
The  amount  of  fading  will  vary  depending  on  the  inter-node 
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Fig.  8.  Principle  of  operation  of  high-speed  XPM  fiber-based  wavelength  converter. 


(b) 

Fig.  9.  (a)  Nonlinear  transmission  characteristic  of  fiber  XPM-WC  and  (b)  40 

Gb/s  original  and  converted  eye  diagrams. 

distance  and  will  help  determine  the  required  dynamic  range  of 
the  burst-mode  subcarrier  receiver.  Therefore,  either  dispersion 
compensation  or  dispersion  insensitive  detection  techniques 
must  be  employed. 

When  the  subcarrier  signal  is  detected  at  a  square  law  pho¬ 
toreceiver  the  subcarrier  amplitude  depends  upon  the  phase  dif¬ 
ference  between  the  two  sidebands.  When  this  signal  is  propa¬ 
gated  through  a  fiber,  the  dispersion  induces  a  relative  time  delay 
between  the  subcarrier  sidebands.  Thus  the  received  power  Ps 
has  a  dependence  on  the  distance  traversed  in  the  fiber.  For  cer¬ 
tain  values  of  fiber  length  L  (or  u;s  for  a  fixed  fiber  length)  the 
received  power  Ps  is  null  and  the  signal  is  cancelled  out.  In  a 
multihop  system  where  the  length  of  fiber  through  which  the 
light  travels  is  unknown,  this  effect  will  cause  label  loss. 

By  suppressing  the  subcarrier,  the  sidebands  are  detected  as 
individual  wavelengths  carrying  the  same  information  and  the 
beat  products  between  each  sideband  and  the  optical  carrier  is 
no  longer  detected.  Therefore,  only  the  information  modulated 


on  the  sidebands  is  recovered  and  dispersion  induced  fading  is 
eliminated.  We  have  successfully  employed  a  fiber  loop  mirror 
(FLM)  to  solve  the  subcarrier  fading  problem  while  at  the  same 
time  providing  a  means  to  simultaneously  tap  and  erase  SCM  la¬ 
bels  [14].  The  fiber  loop  mirror,  shown  in  Fig.  10(a),  drops  the 
suppressed  carrier  signals  to  the  tap  port  for  detection  and  label 
recovery.  The  pass-through  port  forward  the  baseband  packet 
with  erased  label  to  a  wavelength  conversion  stage.  The  filter 
peaks  are  designed  with  a  period  to  match  the  ITU  grid.  The 
filter  employs  two  bireftingent  elements  of  equal  length  oriented 
with  principal  axis  at  45°  to  one  another  and  a  half  wave  plate 
with  principal  axes  oriented  at  45°  with  respect  to  the  nearest 
bireftingent  element.  The  use  of  two  bireftingent  elements  of 
equal  length  in  series  produces  a  wide  passband  for  transmis¬ 
sion  between  the  input  port  and  the  through  port  and  a  narrow 
passband  for  transmission  between  the  input  port  and  the  OSCM 
extraction  port.  The  filter  can  be  implemented  entirely  in  fiber 
using  bireftingent  fiber  and  polarization  controllers  or  it  may  be 
implemented  using  bireftingent  crystals.  The  transmission  spec¬ 
trum  of  an  all-fiber  version  of  the  filter  fabricated  at  UCSB  is 
shown  in  Fig.  10(b)  and  was  measured  using  a  broadband  light 
source  and  a  high-resolution  optical  spectrum  analyzer. 

C.  Wavelength  Agile  Sources 

Packet  wavelength  conversion  critically  depends  on  the 
switching  speed  of  a  tunable  laser  used  as  the  local  source 
for  the  new  wavelength.  New  multi-element  widely  tunable 
lasers  with  the  capability  of  tuning  over  hundreds  of  channels 
represent  the  current  state  of  the  art  in  semiconductor  laser 
development.  There  is  a  wide  variety  of  tunable  laser  designs 
and  they  all  have  a  number  of  common  characteristics.  They 
have  active  sections  for  gain  integrated  with  passive  waveguide 
elements  that  are  used  for  wavelength  control,  they  employ 
some  form  of  grating  architecture  for  wavelength  discrimina¬ 
tion,  and  all  use  carrier  injection  for  tuning.  In  this  paper  we 
describe  three  of  the  most  promising  candidates  to  be  used  as 
wavelength  agile  sources — Sampled  Grating  DBR  (SGDBR), 
Super  Structure  Grating  DBR  (SSGDBR)  and  Grating  assisted 
co-directional  Coupler  with  Sampled  Reflector  (GCSR).  The 
second  part  of  this  section  will  describe  recent  advances  in  the 
fast  switching  of  a  GCSR  laser. 

1)  Wavelength  Agile  Laser  Structures:  The  Sampled 
Grating  DBR  (SGDBR)  lasers  were  the  first  ones  among  the 
mentioned  widely  tunable  laser  structures  [28].  The  SGDBR 
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Fig.  10.  (a)  Schematic  of  an  all-fiber  birefringent  loop  mirror  filet  (LMF).  (b)  Transmission  spectrum  of  an  all-fiber  version  of  the  filter. 


is  similar  in  structure  to  a  DBR  laser  except  that  it  employs 
a  Vernier  tuning  mechanism  to  increase  its  tuning  range.  The 
SGDBR  uses  a  pair  of  grating  mirrors  at  either  end  of  the  cavity 
[Fig.  11(a)].  The  gratings  in  the  mirrors  are  periodically  sam¬ 
pled  which  results  in  a  sequence  of  equally  spaced  short  grating 
bursts.  These  give  the  mirrors  a  comb  like  reflection  spectrum 
with  multiple  equally  spaced  peaks.  Different  sampling  periods 
are  used  in  the  front  and  back  mirror  to  give  them  different  peak 
spacing.  This  allows  only  a  single  pair  of  peaks  to  be  aligned 
concurrently.  The  laser  operates  at  the  wavelength  where  the 
peaks  from  the  front  and  back  mirrors  are  aligned  [Fig.  1 1(b)]. 

The  Super  Structure  Grating  DBR  (SSGDBR)  is  a  modifica¬ 
tion  of  the  SGDBR  design  and  is  only  different  by  the  design  of 
the  mirrors  [29],  [30].  In  the  SSGDBR,  the  grating  in  the  mir¬ 
rors  are  periodically  chirped  instead  of  sampled.  The  advantage 
of  this  approach  is  that  the  grating  occupies  the  entire  length 
of  the  mirror  so  that  a  much  higher  reflectivity  can  be  achieved 
with  a  lower  coupling  constant  in  the  grating.  The  other  main 
advantage  is  that  the  reflectivity  of  the  individual  peaks  can  be 
tailored  such  that  all  of  the  reflection  peaks  have  the  same  mag¬ 
nitude. 

The  Grating  assisted  co-directional  Coupler  with  Sampled 
Reflector  (GCSR)  [31],  [32]  device  as  the  name  indicated 
uses  a  grating  assisted  co-directional  coupler  within  the  laser 
as  a  frequency  discriminator  [Fig.  11(c)].  The  wavelength 
selection  is  illustrated  in  Fig.  11(d).  The  coupler  filter  is  tuned 
such  that  one  peak  of  the  comb  generated  by  the  Sampled 


Reflector  coincides  with  the  maximum  transmission  of  the 
coupler.  The  resulting  reflectivity  is  a  main  peal  surrounded 
by  lower  side  peaks.  The  coarse  tuning  is  obtained  when 
only  the  coupler  current  is  changed.  The  lasing  wavelength 
is  tuned  in  large  steps  corresponding  to  the  reflector  peak 
separation.  Depending  on  the  device  the  tuning  range  extends 
between  50  and  114  nm.  The  medium  tuning  occurs  when 
both  coupler  and  reflector  are  tuned  at  the  same  rate  such  as 
the  same  peak  is  selected  all  the  time.  The  medium  tuning 
range  is  between  5  and  15  nm,  similar  to  the  tuning  range  of 
a  DBR  laser.  The  fine  tuning  occurs  when  the  phase  cuixent  is 
changed.  The  lasing  wavelength  is  tuned  continuously  over  a 
cavity  mode  spacing.  By  combining  the  three  types  of  tuning 
it  is  possible  to  obtain  a  truly  continuous  tuning,where  any 
wavelength  can  be  achieved. 

2)  Fast  Switching  Wavelength  Agile  Transmitters:  By  using 
proper  driving  circuitry  and  conditions  it  is  possible  to  obtain 
fast  (on  the  order  of  several  nanoseconds)  switching  between 
ITU  grid  lasing  modes  over  a  broad  range  of  wavelengths.  We 
have  demonstrated  fast  switching  of  the  GCSR  laser  under  dig¬ 
ital  control.  Two  arrays  of  HBT-based  laser  drivers  were  used  to 
obtain  different  levels  of  output  current  that  were  used  to  drive 
coupler  and  reflector  tuning  sections,  thus  making  it  possible 
to  achieve  different  lasing  wavelengths.  Schematic  of  this  setup 
is  shown  in  Fig.  12(a).  If  n  laser  drivers  are  used  for  both  cou¬ 
pler  and  reflector  sections,  2x2n  different  channels  may  be  ac¬ 
cessed.  Fig.  12(b)  shows  measured  switching  time  for  36  chan- 
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Fig.  11.  SGDBR  laser  structure. 

nels  located  on  the  ITU  frequency  grid.  The  longest  switching 
time  is  less  than  4.5  ns. 

D .  Electroabsorption  Modulators  for  40  Gb/s  Packet  Detection 

High-speed  demultiplexing  is  one  of  the  key  technology  in 
high  bit  rate  time-division-multiplexed  transmission  systems. 
Demultiplexing  based  on  high-speed  integrated  circuit  tech¬ 
nology  has  been  demonstrated  at  60  Gb/s  [33];  however,  it  has 
been  limited  to  40  Gb/s  in  fiber  transmission  experiments.  On 
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Fig.  12.  (a)  Schematic  of  fast  switching  using  a  GCSR  laser  and  (b)  measured 

switching  times  for  36  ITU  frequency  channels. 


the  other  hand,  optical  demultiplexing  using  a  sinusoidally 
driven  electroabsorption  modulator  (EAM)  has  emerged  as  a 
simple  alternative  approach  to  electrical  demultiplexing  [34], 
with  demonstrations  at  bit  rates  up  to  160  Gb/s  [35].  Due  to  its 
nonlinear  attenuation  characteristic,  a  highly  reverse  biased  EA 
modulator  is  capable  of  producing  a  short  switching  window 
with  high  extinction  ratio  for  selecting  the  desired  channel  from 
the  incoming  data  stream. 

Optical  demultiplexing  based  on  EAMs  fabricated  at  UCSB 
was  recently  deployed  in  the  40  Gb/s  wavelength  routing  exper¬ 
iment  [8].  The  400-^m  long  EA  modulator  is  based  on  a  trav¬ 
eling-wave  electrode  structure  with  ten  periods  of  strain-com¬ 
pensated  InGaAsP  quantum  wells  fabricated  by  MOCVD  on 
semi-insulating  InP  substrate  [36],  as  shown  in  Fig.  13.  The  de¬ 
vice  achieved  a  maximum  extinction  of  38  dB  while  its  modu¬ 
lation  bandwidth  was  16  GHz.  The  400-mm  EA  modulator  was 
reverse  biased  at  -4.8  V  and  driven  by  a  6  Vpp  10  GHz  RF 
signal  to  generate  a  14-ps  switching  window,  which  was  syn¬ 
chronized  to  the  desired  40  Gb/s  optical  channel  by  an  electrical 
delay  line.  The  demultiplexed  optical  channel  was  then  optically 
amplified  and  detected  by  a  10-GHz  receiver. 
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Fig.  13.  High  frequency  EAM  used  for  40  Gb/s  optical  demultiplexing  and  (a) 
40  Gb/s  input  and  (b)— (e)  demultiplexed  outputs. 


UCSB  overcomes  some  of  the  disadvantages  of  the  differen¬ 
tially  driven  Mach-Zehnder  transmitter  [14].  This  design  had, 
however,  increased  complexity  and  requires  stability  issues  to 
be  addressed.  A  schematic  diagram  of  this  transmitter  is  shown 
in  Fig.  15(a).  In  this  transmitter  the  baseband  data  and  the  sub¬ 
carrier  signal  are  impressed  on  the  same  optical  carrier  inde¬ 
pendently  using  two  separate  external  modulators.  The  separate 
signals  are  then  combined  using  an  optical  multiplexer.  Because 
the  signals  are  generated  separately  they  can  be  optimized  inde¬ 
pendently  and  their  relative  amplitudes  controlled  using  optical 
attenuation  prior  to  the  multiplexer.  In  order  to  avoid  coherent 
interference  between  the  signals,  filtering  must  be  employed  to 
remove  the  carrier  from  one  of  the  signals.  In  addition,  the  sub¬ 
carrier  signal  must  be  out-of-band  relative  to  the  baseband  data 


signal  or  filtering  of  the  baseband  signal  to  remove  spectral  con¬ 
tent  of  at  the  subcarrier  frequency  must  be  done.  In  our  method 
both  the  multiplexing  and  the  required  filtering  is  achieved  using 
a  birefiingent  loop  mirror  filter  shown  in  Fig.  15(b). 

The  principal  advantage  of  this  transmitter  is  that  the  strength 
of  the  subcamer  signal  relative  to  the  baseband  signal  may  be 
easily  controlled  and  that  there  is  no  subcarrier  amplitude  de¬ 
pendent  penalty  introduced  in  the  baseband  data.  In  addition, 
both  the  chirp  of  the  subcarrier  signal  and  the  baseband  data 
signal  may  be  controlled  independently.  The  primary  disadvan¬ 
tage  of  this  technique  is  that  the  filter  spectral  response  must  be 
stable  relative  to  the  laser  wavelength  necessitating  tight  tem¬ 
perature  control  of  the  birefiingent  elements  in  the  filter  or  some 
type  of  closed  loop  locking  arrangement. 

2)  Time  Multiplexed  Transmitters:  For  low  bit  rate  transmit¬ 
ters,  10  Gb/s  and  below,  the  label  and  the  payload  will  prob- 
Fig.  14.  Optical  subcanier  multiplexed  transmitter  based  on  differential  abty  have  the  same  data  format  and  the  whole  packet  can  be 
external  modulator.  encoded  using  conventional  modulation  techniques  for  either 

NRZ  or  RZ  data  format.  For  high  bit  rate  transmitters,  beyond 

E.  Packet  Transmitters  40  Gb/ s’  “  wil1  most  be  necessary  to  use  OTDM  transmit- 

.  ters  where  several  low  bit  rate  data  streams  are  time  multiplexed 

Generation  of  packets  requires  compatible  coding  of  the  pay-  together.  The  label  is  attached  to  the  packet,  at  a  lower  bit  rate  to 
oad  and  label  witii  the  label  swapping  hardware  within  the  net-  allow  lower  cost  label  processing  electronics  to  be  used.  Fig.  1 6 

wor  .  n  this  section  we  describe  transmitters  that  can  generate  shows  a  possible  implementation  of  such  transmitter  generating 
packets  with  subcamer  or  serial  labels.  a  40  Gb/s  IP-packet  with  a  2.5  Gb/s  attached  time  domain  label 

.  .  ,  Transmitters:  The  ideal  OSCM  transmitter  is  one  The  incoming  low  bit  rate  IP  packet  is  demultiplexed  and  stored 

that  is  simple,  robust,  permits  an  arbitrary  subcamer  to  carrier  in  an  electronic  4  x  JV  memory  bank  that  effectively  acts  as 
modulation  ratio,  and  does  not  introduce  penalty  into  either  the  a  serial  to  parallel  converter.  This  memoiy  bank  can  be  made 
baseband  data  or  the  subcamer  signal.  Itis  also  desirable  that  the  of  fast  electronic  memory  arranged  in  a  way  that  it  is  possible 
introduction  of  chiip  in  the  baseband  and  subcamer  signals  be  fill  the  memoiy  at  the  incoming  bit  rate  and  to  read  the  four 
avoided  or  controlled.  In  this  section  we  describe  two  techniques  memory  banks  in  parallel.  Once  the  whole  packet  is  stored,  the 
for  the  generauon  of  hybrid  baseband  and  optically  subcanier  four  rows  are  read  in  parallel  at  10  Gb/s.  Since  in  practice  elec- 

SlgnalS  11131  h3Ve  been  employed  at  UCSB-  tronic  memory  can  not  be  read  at  10  GHz,  multiple  memory 

The  first  technique  is  the  simplest  to  implement  and  most  banks  operating  at  the  speed  of  the  memory  have  to  be  multi- 
robust.  In  this  technique  a  differential  Mach-Zehnder  external  plexed  together  constituting  a  memory  read  at  10  GHz  Each  10 
modulator  is  used  to  impress  both  the  baseband  data  and  the  Gb/s  electrical  channel  subsequently  drives  an  optical  RZ  trans- 
subcamer  signal  onto  the  optical  earner  simultaneously.  Details  mitter  the  gives  10  Gb/s  RZ  data  with  about  10  ps  pulses,  e.g.,  an 
of  the  tradeoffs  for  this  type  of  transmitter  are  given  in  [1 1].  A  integrated  transmitter  consisting  of  a  DFB  laser  with  two  inte- 
schematic  diagram  of  this  transmitter  is  shown  in  Fig.  14.  Only  grated  EAMs.  The  relative  timing  between  the  10  Gb/s  channels 
a  single  Mach-Zehnder  modulator  is  employed  to  impress  both  are  adjusted  to  be  25  ps  between  each  channel  by  using  optical 
the  subcamer  signal  as  well  as  the  baseband  data  onto  the  op-  delays.  Finally  the  four  channels  are  interleaved  in  a  fiber  cou- 
fical  earner.  One  port  of  the  Mach-Zehnder  is  driven  with  the  pier  to  generate  a  40  Gb/s  data  sequence.  The  label  is  generated 
baseband  signal  and  the  other  port  receives  the  subcanier  signal,  simply  by  encoding  the  label  data  at  2.5  Gb/s  NRZ  on  light  with 
e  second  transmitter  architecture  that  has  been  employed  at  the  same  wavelength  as  the  RZ  sources  and  then  attached  to  the 
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Fig.  15.  Optically  multiplexed  SCM  transmitter. 
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Fig.  16.  Implementation  of  a  high-speed  packet  serial  label  transminer. 


40  Gb/s  packet  after  proper  adjustment  of  the  relative  timing. 
Since  all  optical  sources  emit  light  at  the  same  wavelength,  it 
is  in  practice  difficult  to  achieve  the  extinction  ratio  (ER)  re¬ 
quired  from  the  sources  when  being  in  off-state.  Therefore  an 
optical  switch  is  used  at  the  output  to  avoid  coherent  crosstalk 
between  the  label  source  and  the  RZ  data.  This  scheme  can  be 
extended  to  higher  bit  rates  simply  by  increasing  the  number 
of  rows  in  the  serial  to  parallel  converting  memory  and  by  de¬ 
creasing  the  output  pulse  width  from  the  RZ  transmitters.  The 
RZ  transmitters  can  either  be  four  individual  transmitters  con¬ 


stituted  by  DFB  lasers  with  integrated  electroabsorption  modu¬ 
lators,  or,  if  very  short  pulses  are  required,  one  common  pulse 
source,  e.g.,  a  fiber  ring  laser,  can  be  shared  by  four  different 
data  encoding  modulators. 

VII.  AOLS  Subsystem  Implementations 

In  this  section  we  review  subsystems  that  realize  the  func¬ 
tions  described  in  Fig.  4  using  the  technologies  in  the  previous 
section. 
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Fig.  17.  (a)  Two-stage  AOLS  wavelength  converter  using  cascaded  SOA  XGM  and  XPM  modules  and  (b)  second  stage  XPM-WC. 


A.  Two-Stage  SOA  Wavelength  Converters 
with  Subcarrier  Labels 

We  have  implemented  and  demonstrated  an  AOLS  wave¬ 
length  converter  using  a  two-stage  SOA  wavelength  converter 
module  [37],  [38]  shown  in  Fig.  17(a).  The  two-stage  geometry 
is  designed  to  address  key  issues  in  wavelength  conversion  and 
optical  label  swapping.  The  XGM-WC  stage  erases  the  SCM 
header  using  the  inherent  low-pass  filtering  function  of  XGM 
wavelength  conversion  in  SOAs,  as  described  in  [38].  It  is  also 
used  to  convert  packets  from  an  arbitrary  network  wavelength 
to  a  fixed  internal  wavelength  allowing  the  use  of  fixed  optical 
band  pass  filters  (BPF)  to  pass  only  the  internal  wavelength 
and  reject  the  original  wavelength.  Another  key  benefit  of  the 
two-stage  geometry  is  that  the  first  stage  converts  an  arbitrary 
polarization  at  the  input  to  a  fixed  polarization  at  the  output 
for  the  second  stage  converter,  making  it  easier  to  fabricate  the 
XPM-WC  technology.  The  first  stage  also  reduces  the  packet 
optical  power  dynamic  range  and  sets  the  packet  average  optical 
power  level  for  the  XPM-WC.  The  inverting  operation  of  the 
XPM-WC  is  used  in  the.second  stage,  as  shown  in  Fig.  17(b) 
to  provide  noninverting  operation  after  both  stages.  The  highly 
nonlinear  transfer  function  of  the  interferometric  wavelength 
converter  in  the  second  stage  is  used  to  enhance  the  optical 
extinction  ratio  (2R  regeneration)  and  convert  the  packet  to  an 
outgoing  wavelength  with  a  fixed  band  reject  filter  (BRF)  that 
passes  only  the  new  tunable  wavelengths.  A  fast  wavelength 
tunable  laser  is  utilized  to  enable  packet-rate  wavelength  con¬ 
version.  The  new  label  is  pre-modulated  onto  the  fast  tunable 
laser.  As  part  of  our  NGI  program  we  are  leveraging  recent 
developments  in  the  field  of  photonic  integrated  circuits  (PICs) 
to  realize  two-stage  AOLS  technology  through  integration  of 
fast  widely  tunable  lasers  with  SOAs,  waveguides  and  on-chip 
wavelength  monitors. 


B.  Fiber  Loop  Mirrors  with  Subcarrier  Labels  and 
Two-Stage  SOA-WC 

The  fiber  loop  mirror  (FLM)  module  provides  label  removal 
and  a  tap  for  SCM  labels  that  is  immune  to  fiber  dispersion  and 
utilizes  a  simple  detector  for  label  recovery.  The  packet  without 
label  is  passed  to  the  through  port  and  may  be  injected  into  a 
two-stage  AOLS-WC  based  on  SOA  technology  as  shown  in 
Fig.  18.  This  architecture  combines  very  high  suppression  of  the 
original  label  over  the  complete  ITU  grid  with  the  wavelength 
conversion  and  label  rewriting  properties  of  the  integrated  two- 
stage  wavelength  converters. 

C.  Fiber  Wavelength  Converters  and  Serial  Labels 

Optical  label  swapping  of  an  attached  time  domain  label  to  a 
packet  can  be  implemented  using  the  inherent  nonlinear  transfer 
function  and  differentiating  nature  of  fiber  XPM  wavelength 
converter.  The  transmission  function  of  the  wavelength  con¬ 
verter  is  highly  nonlinear,  which  also  depends  on  the  derivative 
of  the  input  signal.  By  using  different  data  formats,  bit  rates, 
and  power  levels  for  the  label  and  the  payload,  while  keeping 
the  energy  per  bit  constant,  it  is  possible  to  remove  the  label 
upon  wavelength  conversion. 

A  particular  scheme  that  has  been  investigated  is  where  the 
payload  is  in  RZ  format  and  the  header  is  in  NRZ  format.  This 
allows  efficient  use  of  bandwidth  for  the  payload  while  retaining 
simplicity  in  handling  the  labels.  If,  for  example,  10  ps  pulses 
are  used  for  a  40  Gb/s  payload  and  2.5  Gb/s  NRZ  data  and  if 
equal  energy  per  received  bit  is  assumed,  the  peak  power  of 
the  labels  will  be  16  dB  lower  than  the  peak  power  of  the  pay- 
load.  Thus  the  transmission  through  the  wavelength  converter 
will  be  extremely  low.  In  addition  the  transmission  through  the 
wavelength  converter  depends  on  the  derivative  of  the  input 
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Fig.  18.  AOLS  module  using  FLM  to  remove  SCM  label  and  two-stage  AOLS-WC  to  convert  packet  and  rewrite  label. 


Fig.  19.  Packets  of  2.5  Gb/s  NRZ  data  and  10  Gb/s  RZ  data  (a)  before  and  (b)  after  wavelength  conversion. 


signal,  why  only  transitions  in  the  NRZ  data  will  experience 
some  transmission.  All  energy  in  the  input  data  with  zero  deriva¬ 
tive  will  be  blocked  in  the  wavelength  converter,  i.e.  most  energy 
in  NRZ  data  will  not  drive  the  wavelength  converter.  After  re¬ 
moving  the  incoming  label  a  new  label  can  easily  be  inserted 
either  by  adding  a  new  label  from  a  separate  source  after  the 
wavelength  converter  or  simply  by  premodulating  the  new  label 
on  the  CW  light  that  determines  the  output  wavelength. 

To  demonstrate  the  concept  of  label  replacement,  a  system 
similar  to  that  in  Fig.  8  was  used.  Packets  of  10  Gb/s  RZ  data  was 
mixed  with  2.5  Gb/s  NRZ  as  shown  in  Fig.  19(a).  The  frequency 
deviation  that  generates  the  optical  side  bands  of  the  CW  light 
will  depend  on  the  derivative  of  the  input  pulse  power,  which 
is  turned  into  amplitude  modulation  after  filtering.  Due  to  this 
differentiating  nature,  the  rise  and  fall  times  determine  the  con¬ 
version  efficiency,  which  means  that  the  wavelength  converter 
will  block  low  frequencies.  In  Fig.  19(b)  it  can  clearly  be  seen 
how  the  10  Gb/s  RZ  data  is  converted  and  the  2.5  Gb/s  NRZ 
data  is  suppressed.  It  should  be  noted  that  NRZ  data  can  not 
be  converted,  as  only  the  edges  of  a  signal  are  converted.  In  a 
sequence  of  consecutive  ONEs,  only  the  first  ONE  would  gen¬ 
erate  an  output  signal.  However,  there  would  still  be  some  output 
that  could  cause  crosstalk  when  a  new  header  is  inserted,  even 
though  the  old  header  data  has  been  corrupted.  For  RZ  data,  the 
pulse  width,  and  not  the  bit  rate  will  determine  the  conversion  ef¬ 


ficiency.  In  another  experiment,  the  10  Gb/s  RZ  data  and  the  2.5 
Gb/s  NRZ  data  were  switched  manually,  to  measure  the  optical 
spectra,  and  to  measure  the  penalty  for  rewriting  new  2.5  Gb/s 
data  where  the  original  data  was  erased.  At  the  input  to  the  wave¬ 
length  converter,  the  average  power  of  the  NRZ  and  the  RZ  data 
were  equal  and  after  removal  of  the  original  NRZ  data  a  new  2.5 
Gb/s  NRZ  data  was  inserted.  The  bit-error  rate  was  measured 
for  the  original  and  rewritten  data  using  231  —  1  pseudo-random 
data  and  no  penalty  was  observed  due  to  the  original  data  in  the 
rewritten  2.5  Gb/s  NRZ  data.  From  the  measurements  it  is  clear 
that  this  type  of  wavelength  converter  can  be  used  to  passively 
erase  a  2.5  Gb/s  NRZ  header  from  a  high-speed  RZ  payload. 
Compared  to  other  approaches  using  time-domain  header  [8], 
[39],  no  timing  control  is  required  to  erase  the  header.  New  2.5 
Gb/s  data  can  then  be  rewritten  in  different  ways.  Here  a  sepa¬ 
rate  transmitter  was  used  for  the  new  data,  which  showed  now 
crosstalk  penalty  from  the  previously  erased  data.  This  makes 
the  system  more  stable  compared  to  the  approach  of  premod¬ 
ulating  and  slightly  detuning  the  local  CW  light  in  the  wave¬ 
length  converter,  but  requires  a  high  extinction  ratio  of  the  new 
header  source  when  the  payload  is  present.  A  third  option,  which 
has  not  been  experimentally  investigated,  is  to  encode  the  new 
header  by  frequency  modulating  the  local  CW  laser  in  the  wave¬ 
length  converter.  With  the  present  tuning  speed  of  about  5  ns  this 
would  limit  the  bit  rate  to  a  maximum  of  100  Mbit/s,  but  with 
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Fig.  20.  Experimental  setup  of  the  packet  routing  experiment.  MOD:  LiNb03  modulator.  MUX:  Passive  10-40  Gb/s  multiplexer:  EDFA-  Erbium-doped  fiber 

^LSpe?“f  Shifted  fiben  ^  Polarization-n>aintaming  fiber  PC:  Polarization  controller;  GCSR:  Grating  coupled  sampled  rear  reflector  laser; 
EAM  DEMUX:  Demultiplexer  using  an  electro-absorption  modulator  BERT:  Bit-error-rate  test  set. 


further  advances  in  tunable  laser  technology  this  could  be  an  in- 
teresting  option. 

Vin.  Experimental  Demonstrations 

A.  40  Gb/s  AOLS  Using  Fiber  XPM  Wavelength  Converter 

To  demonstrate  AOLS  at  high  bit  rates  an  experiment  was  per¬ 
formed  where  packets  of  40  Gb/s  data  was  routed  to  different 
wavelengths  using  a  wavelength  converter.  The  40  Gb/s  pay- 
load  was  in  RZ  format  and  an  attached  time  domain  label  was 
encoded  using  NRZ  at  2.5  Gb/s.  By  using  a  fast  tunable  laser 
each  packet  can  be  routed  to  different  wavelengths.  The  attached 
label  is  also  replaced  upon  wavelength  conversion  by  using  the 
technique  described  in  Section  VII-C  above. 

Fig.  20  shows  the  experimental  set-up.  The  packet  generator 
consisted  of  an  actively  mode-locked  fiber  ring  laser  generating 
10  ps  at  1536  nm  followed  by  a  LiNb03  modulator  encoding  10 
Gb/s  data.  The  10  Gb/s  data  were  injected  into  a  passive  10  to 
40  Gb/s  multiplexer  and  an  acousto-optical  modulator  (AOM) 
gated  out  a  2.5  jjs  payload  that  was  combined  with  a  2.5  Gb/s 
500  ns  long  label.  The  label  was  aligned  in  front  of  the  pay- 
load  with  100  ns  guard  band  determined  by  the  100  ns  rise  time 
in  the  AOM.  A  100  ns  guard  band  was  inserted  between  each 
packet,  giving  a  total  packet  length  of  3.2  /xs.  The  packets  were 
then  injected  into  the  wavelength  converter  which  consisted  of 
an  erbium-doped  fiber  amplifier  (EDFA)  followed  by  5  km  dis¬ 
persion-shifted  fiber  (DSF)  with  a  zero-dispersion  wavelength 


Fig.  21.  (a)  Packets  and  (c)  40  Gb/s  eye-patterns  before  wavelength  routing, 
(b)  Packets  routed  to  1538  nm  and  1543  nm  and  (d)  eye  pattern  of  routed  packet 
at  1543  nm. 


of  1542  nm.  A  fast  tunable  laser  [31]  that  could  be  tuned  to  ei¬ 
ther  1538  or  1543  nm  within  5  ns,  determined  the  new  wave¬ 
length.  This  laser  was  also  used  to  encode  the  new  2.5  Gb/s 
label  before  entering  the  wavelength  converter.  After  the  DSF, 
a  loop  mirror  filter  was  used  to  suppress  the  original  CW  light. 
A  second  filter  was  used  to  select  one  of  the  two  sidebands  and 
to  suppress  the  original  data.  The  use  of  only  one  sideband  re¬ 
tained  the  pulse  width  and  time  bandwidth  product  (TBP)  from 
the  input  pulse.  The  40  Gb/s  receiver  consisted  of  a  40  Gb/s 
to  10  Gb/s  demultiplexer  followed  by  a  10  Gb/s  preamplified 
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Fig.  22.  Subcarrier  multiplexed  AOL  experiment  with  two-stage  SOA-based  label  swapping  and  wavelength  conversion. 


receiver.  The  2.5  Gb/s  label  receiver  consisted  of  an  AOM  to 
remove  the  payload,  which  otherwise  would  dominate  the  mea¬ 
sured  optical  average  power,  followed  by  an  optically  pream¬ 
plified  receiver.  The  BER  measurements  were  made  on  both 
payload  and  label,  and  gated  to  select  appropriate  time  interval. 
Fig.  21(a)  shows  the  input  packets  consisting  of  the  40  Gb/s  RZ 
payload  and  the  2.5  Gb/s  NRZ  label  with  about  10  dB  less  peak 
power.  In  the  lower  trace  the  labels  without  the  payload  is  re¬ 
ported.  Fig.  21(b)  shows  every  other  packet  routed  to  1538  nm 
and  1543  nm  with  new  inserted  labels.  The  eye  patterns  shown 
in  Fig.  21(c)  and  (d)  was  taken  by  zooming  into  the  payload,  be¬ 
fore  (1536  nm)  and  after  wavelength  converter  (1543  nm).  BER 
measurements  were  performed  on  both  label  and  payload  on  in¬ 
coming  and  outgoing  packets  and  less  than  4  dB  penalty  at  a 
BER  =  10-9  was  observed  for  all  routed  packets.  The  power 
penalty  at  a  BER  =  1CT9  for  the  replaced  labels  was  less  than 
2.5  dB.  For  wavelength  routing  at  10  Gb/s,  the  penalty  was  less 
than  1  dB  for  the  payload,  which  indicates  that  a  major  part  of 
the  penalty  is  due  to  instabilities  in  the  40  Gb/s  data. 

B.  2.5  Gb/s  AOLS  Using  Two-Stage  SOA  Wavelength 
Converters 

The  WDM  AOLS  experiment  that  utilized  a  two-stage 
wavelength  converter  is  shown  in  Fig.  22  [3],  [4].  An  OSCM 
packet  transmitter  was  used  to  generate  a  150  Mbps  label  on  an 
RZ  coded,  ASK  modulated  16  GHz  subcarrier.  Labels  consist 
of  a  16-bit  preamble,  an  84-bit  tag  and  a  4-bit  framing  sequence. 
The  payload  is  an  NRZ  coded,  2.5  Gb/s  PRBS  and  the  16 
GHz  subcarrier  supports  payload  bit  rates  out  to  10  Gb/s.  The 
packet  duration  is  1  ms.  Label  clock  and  data  are  recovered  on 
a  packet-by-packet  basis  following  a  10%  fiber  tap,  an  EDFA 
and  an  OSCM  direct-detection  receiver.  The  OSCM  receiver 
utilizes  a  fast  Schottky  barrier  diode  for  envelope  detection. 
A  SAW  filter  is  used  to  recover  the  tag  clock  for  each  packet 
and  a  fixed  digital  delay  is  needed  to  realign  data  and  clock.  A 
tag  switching  processor  is  required  to  perform  serial-to-parallel 
conversion,  compute  a  new  label,  multiplex  the  new  label  onto 
a  RF  subcarrier  and  set  a  fast  wavelength  tunable  laser  to  the 
new  wavelength  within  12  ns. 

A  two-stage  modified  XGM/IWC  was  used  to  perform  wave¬ 
length  routing,  optical  label  erasure  and  label  rewriting  at  the 


EP  packet  rate.  The  OSCM  label  was  removed  during  XGM  due 
to  the  low  pass  frequency  response  of  wavelength  conversion  in 
an  SOA  [38].  The  XGM-WC  converts  incoming  WDM  packets 
to  a  fixed  internal  wavelength  Aint  that  is  passed  to  the  next 
stage  using  a  fixed  frequency  optical  filter.  The  XGM-WC  is 
also  used  to  set  a  stable  operating  point  for  the  IWC.  One  arm  of 
an  InGaAsP  IWC  is  injected  with  the  optically  filtered  output  of 
the  XGM-WC.  The  output  of  a  rapidly  tunable  4-section  GCSR 
laser  transmitter  is  injected  to  both  arms  of  the  IWC.  We  mea¬ 
sured  the  transmission  BER  for  the  label-switched  payload  with 
an  observed  maximum  4  dB  power  penalty  for  all  four  wave¬ 
lengths.  The  power  penalty  is  expected  to  decrease  when  opti¬ 
mally  designed  wavelength  converters  are  used. 

IX.  Summary 

We  have  presented  research  results  of  the  UCSB  NGI  project 
in  AOLS.  The  AOLS  approach  to  wire-rate  packet  encapsula¬ 
tion,  routing  and  forwarding  directly  in  the  optical  layer  hold 
promise  to  support  terabit-per-second  fiber  capacities  indepen¬ 
dent  of  packet  bit-rate  and  protocol.  Two  primary  optical  label 
coding  techniques,  serial-  and  subcarrier  multiplexed-labels, 
were  described  and  the  functional  blocks  used  to  erase  labels, 
rewrite  labels,  and  convert  packets  to  a  new  wavelength  were 
reviewed.  Issues  involving  burst-mode  packet  routing  in  an 
AOLS  environment  were  discussed.  Results  were  presented 
for  both  fiber  and  semiconductor-based  technologies  that  have 
been  used  to  perform  AOLS  functions.  Key  approaches  to 
transmitting  and  processing  subcarrier  labels  were  described 
based  on  fiber  loop  mirrors  used  to  realize  supressed  carrier, 
subcarrier  label  detectors  that  are  immune  to  fiber  dispersion 
induced  signal  fading.  These  taps  can  also  be  used  to  erase 
subcarrier  labels  in  conjunction  with  semiconductor  optical 
amplifer-based  wavelength  converters  used  to  rewrite  the  labels 
and  convert  the  packets  to  a  new  wavelength.  Current  work 
on  highly  integrated  SOA-based  AOLS  modules  was  also 
described.  For  high  bit-rates,  new  AOLS  all-fiber  wavelength 
converter  subsystems  were  used  to  erase  and  rewrite  2.5  Gb/s 
serial  encoded  labels  and  wavelenth  convert  40  Gb/s  packets. 
We  discussed  how  these  technologies  could  be  combined  to 
realize  AOLS  subsystems.  Key  experimental  system  demon¬ 
stration  results  were  also  presented. 
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All-Optical  Demultiplexing  Using  Fiber 
Cross-Phase  Modulation  (XPM) 
and  Optical  Filtering 

Bengt-Erik  Olsson  and  Daniel  J.  Blumenthal,  Senior  Member,  IEEE 


Abstract — All-optical  demultiplexing  of  80-Gb/s  data  to  10  Gb/s 
is  demonstrated  using  spectral  broadening-induced  by  cross-phase 
modulation  (XPM)  with  subsequent  optical  band-pass  filtering. 
Due  to  the  time  derivative  effect  of  XPM,  the  control  pulsewidth 
can  be  larger  than  the  bit-slot  of  the  incoming  data,  and  still 
give  a  switch  window  suitable  for  demultiplexing.  Operation  at 
10" 12  bit-error  rate  is  demonstrated.  In  principle,  this  approach 
will  scale  to  extremely  high  bit  rates  due  to  the  ultrafast  fiber 
nonlinearities. 

Index  Terms — Cross-phase  modulation,  fiber  nonlinearities, 
optical  demultiplexing,  optical  filtering,  optical  networks,  optical 
time-division  multiplexing,  optical-wavelength  conversion. 


I.  Introduction 

FUTURE  ultrahigh  bit-rate  optical  time  division  mul¬ 
tiplexed  (OTDM)  systems  may  require  all-optical 
demultiplexing  to  down  convert  the  high  bit-rate  data  to,  e.g., 
10  or  40  Gb/s  where  electronic  circuits  can  be  used.  Many 
examples  of  such  demultiplexers  have  been  proposed  over 
the  years.  The  most  common  device  is  the  nonlinear  optical 
loop  mirror  (NOLM)  that  allows  switching  due  to  cross-phase 
modulation  (XPM),  in  either  a  fiber  [1]  or  a  semiconductor 
optical  amplifier  (SOA)  [2].  Another  approach  is  to  use 
four- wave-mixing  (FWM)  in  either  an  SOA  or  a  fiber  [3].  In 
all  these  schemes,  the  switch  window  for  the  demultiplexed 
OTDM  channel  can  usually  not  be  shorter  than  the  pulsewidth 
of  the  control  pulse  that  initiates  the  XPM  or  FWM,  and  thus,  a 
very  short  high-quality  optical  pulse  is  required  [4]. 

In  this  letter,  we  demonstrate  a  new  1550-nm  waveband 
demultiplexer  based  on  XPM  in  a  fiber  that  utilizes  the  time 
derivative  effect  of  XPM-induced  spectral  broadening.  Low 
bit-error-rate  (BER)  operation  (<10“12)  is  demonstrated  with 
80-Gb/s  data  optically  demultiplexed  to  10  Gb/s.  In  principle, 
this  approach  will  scale  to  extremely  high  bit  rates,  in  excess  of 
160  Gb/s,  due  to  the  ultrafast  fiber  nonlinearities.  This  approach 
is  based  on  optically  induced  frequency  shifting  of  the  modu¬ 
lated  input  signal  with  a  control  pulse.  The  leading  edge  of  the 
control  pulse,  generates  a  red  shift  of  the  spectrum  of  the  XPM 
modulated  input  signal,  and  the  trailing  edge  generates  a  blue 
shift.  Optically-induced  frequency  shifting  has  previously  been 
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Fig.  1.  Experimental  setup.  ERFRL:  Erbium-doped  fiber  ring  laser.  MOD: 
LiNb03  modulator.  MUX:  Passive  10-80  Gb/s  multiplexer.  TECL:  Tunable 
external  cavity  laser.  EAM:  Electroabsoiption  modulator.  PC:  Polarization 
controller.  EDFA.  OSC:  10-GHz  microwave  oscillator.  DELAY:  Variable 
electrical  delay.  DSF:  Dispersion-shifted  fiber.  BPF:  Bandpass  filter. 

demonstrated  for  pulses  at  an  980-nm  82-MHz  repetition  rate 
using  a  resonantly-excited  semiconductor  nonlinear  waveguide 
combined  with  an  optical  filter  [5],  however,  demonstration 
at  communications  bit  rates  (10  Gb/s)  and  wavelength  with 
real  data,  was  not  investigated  and  only  single-pulse  operation 
shown. 

Previously,  we  have  reported  wavelength  conversion  using 
this  technique,  where  the  incoming  data  modulates  the  phase  of 
a  continuous  wave  (CW)  signal  with  subsequent  conversion  to 
amplitude  modulation  [6].  In  this  letter,  only  one  of  the  OTDM 
data  channels  in  the  high  bit-rate  data  is  spectrally  broadened, 
and  that  channel  is  extracted  using  a  narrow-band  optical 
band-pass  filter  (BPF)  at  either  side  of  the  original  spectrum. 
Thus,  if  dispersive  walkoff  is  neglected,  only  one  edge  of  the 
control  pulse  governs  the  width  of  the  demultiplexing  switch 
window.  Therefore,  a  control  pulse  broader  than  the  actual 
bit  slot  can  be  used  for  demultiplexing.  Another  important 
feature  of  this  demultiplexer  compared  to  the  interferometric 
techniques  like  the  NOLM,  is  its  insensitivity  to  environmental 
disturbances. 

II.  Experiments 

The  experimental  setup  is  shown  in  Fig.  1.  An  actively 
mode-locked  fiber  ring  laser  generated  6-ps  pulses  at  a 
1538.5-nm  wavelength  with  a  10-GHz  repetition  rate.  10-Gb/s 
PRBS  231  -  1  data  was  encoded  onto  the  fiber  ring  laser  output 
using  an  external  modulator.  A  passive  optical  interleaver 
multiplexer  was  used  to  generate  an  80-Gb/s  data  stream. 
The  control  pulses  were  generated  using  an  electroabsoiption 
modulator  that  output  14-ps  pulses  at  a  wavelength  of  1534 
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Fig.  2.  Optical  spectrum  at  the  input  of  the  DSF,  after  the  DSF,  and  after  the  ~30  *20  -10  0  1 1 0  20  30 

0.2-nm  BPF.  Time  offset  between  pump  and  data  pulses  [ps] 


Fig.  3.  80-Gb/s  eye  pattern,  (a)  Demultiplexed  10-Gb/s  channels  (bMi).  The 
extinction  ratios  of  the  output  pulses  were  estimated  to  be  better  than  15  dB. 


Fig.  4.  Measured  switch  window  of  the  demultiplexer. 

pling  oscilloscope.  Fig.  3(bHj)  shows  the  eight  demultiplexed 
10-Gb/s  channels.  No  crosstalk  was  present  even  though  the  con¬ 
trol  pulsewidth  was  14  ps  and  the  bit-slot  at  80-Gb/s  is  12.5  ps. 
The  extinction  ratios  of  the  output  pulses  were  estimated  to  be 
better  than  15  dB. 

To  estimate  the  actual  switch  window  in  the  device,  the  cross 
correlation  between  a  6-ps  data  pulse  and  a  14-ps  control  pulse 
was  measured.  Fig.  4  shows  the  measured  cross  correlation  be¬ 
tween  the  two  pulses  in  the  demultiplexer.  The  measurement 
was  made  by  sliding  the  14-ps  control  pulses  across  the  6-ps 
data  pulses,  while  recording  the  demultiplexer  output  power 
with  a  power  meter.  The  width  of  the  switch  window  was  1 1  ps, 
even  though  a  14-ps  control  pulse  was  used,  which  is  suitable  to 
demultiplex  80  Gb/s.  We  also  noticed  a  somewhat  rectangular 
shape  of  the  switch  window,  which  is  due  to  dispersive  walkoff 
between  control  and  data  pulse,  and  this  effect  is  useful  to  ab¬ 
sorb  timing  jitter  in  the  incoming  data.  The  net  polarization  de- 


nm.  The  80-Gb/s  data  and  the  control  pulses  were  combined 
in  a  50 : 50  coupler  and  amplified  in  an  erbium-doped  fiber 
amplifier  (EDFA)  to  +18  dBm  average  output  power.  A  5-km 
dispersion  shifted  fiber  (DSF)  with  a  zero  dispersion  wave¬ 
length  of  1543  nm,  was  used  to  induce  XPM  from  the  control 
pulses  on  to  one  of  the  10-Gb/s  data  channels  in  the  80-Gb/s 
data  stream. 

The  optical  spectrum  was  broadened  only  during  the  time  slot 
of  one  10-Gb/s  channel,  and  that  channel  was  extracted  by  using 
a  narrow  0.2-nm  optical  BPF  positioned  at  a  center  wavelength 
of  1538  nm.  The  output  from  the  BPF  was  then  sent  to  an  opti¬ 
cally  preamplified  receiver  for  inspection  on  a  sampling  oscillo¬ 
scope,  as  well  as  for  BER  measurements.  Fig.  2  shows  the  optical 
spectra  in  various  points  of  the  system.  The  solid  trace  shows  the 
spectrum  after  the  50 ;  50  combiner.  The  power  ratio  between  the 
control  pulse  and  the  80-Gb/s  data  was  about  20  dB,  to  avoid  sat¬ 
uration  of  the  EDFA  in  front  of  the  DSF  from  the  incoming  data. 
The  dashed  trace  shows  the  spectrum  after  the  DSF,  where  the 
spectrum  of  the  80-Gb/s  data  is  broadened  due  to  XPM  from  the 
control  pulse.  The  dotted  trace  in  Fig.  2  shows  the  filtered  XPM 
broadened  spectrum,  i.e. ,  the  demultiplexed  channel  at  the  output. 
Since  the  BPF  is  only  0.2  nm,  the  output  pulsewidth  of  the  demul¬ 
tiplexed  data  is  17  ps.  Fig.  3(a)  shows  the  input  80-Gb/s  data  eye 
pattern  as  measured  with  a  40-GHz  detector  on  a  50-GHz  sam- 


pendence  of  the  demultiplexer  was  about  3  dB,  even  though  the 
polarization  dependence  of  XPM  in  standard  DSF  is  5  dB.  The 
reason  for  this,  is  that  for  orthogonal  relative  polarization  of  data 
and  control  pulses,  the  spectral  broadening  is  at  its  minimum, 
and  for  decreased  orthogonality,  the  spectrum  gets  broader,  but 
still  leaving  energy  within  the  filter  band  width.  However,  the 
polarization  dependence  depends  heavily  on  the  available  pump 
power,  as  well  as  the  position  of  the  BPF. 

BER  measurements  of  the  demultiplexed  data  are  shown  in 
Fig.  5.  The  back-to-back  receiver  sensitivity  was  measured  from 
10  Gb/s  to  -36.8  dBm,  after  the  data  modulator.  Full  BER 
plots  are  shown  for  channel  one  to  four,  and  BER  measurements 
around  10~9  are  shown  for  channel  five  to  eight.  All  channels 
performed  almost  the  same,  giving  a  penalty  of  about  2  dB.  This 
penalty  is  believed  to  arise  primarily  due  to  nonoptimal  filtering 
of  the  narrow  bandwidth  demultiplexed  10-Gb/s  data  in  the  re¬ 
ceiver.  The  optically  preamplified  receiver  included  a  0.6-nm 
optical  BPF  that  is  close  to  optimum  for  6-ps  pulses,  and  thus 
gives  a  low  receiver  sensitivity  for  the  original  10-Gb/s  data. 
After  demultiplexing,  the  pulsewidth  is  17  ps  with  a  spectral 
width  of  0.2  nm,  which  is  far  from  optimum  for  this  particular 
receiver.  In  fact,  sending  the  original  10-Gb/s  data  with  6-ps 
pulses  through  the  0.2-nm  filter,  before  entering  the  receiver 
gave  also  about  2-dB  penalty,  but  with  a  slightly  better  slope 
of  the  BER. 
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Fig.  5.  BER  measurements  for  80-10-Gb/s  demultiplexing. 

in.  Conclusion 

A  new  all-optical  demultiplexer  based  on  XPM  in  a  fiber  fol¬ 
lowed  by  a  narrow  optical  BPF  is  described  and  demonstrated. 
An  80-Gb/s  data  channel  was  successfully  demultiplexed  to  10 
Gb/s  with  <2  dB  receiver  penalty  using  a  control  pulse  of  14 
ps.  Due  to  the  derivative  effect  of  XPM,  a  control  pulse  that  is 
wider  than  the  bit-slot  of  the  high  bit-rate  data  can  be  utilized. 
The  demultiplexer  is  insensitive  to  environmental  disturbances, 
and  can  be  made  polarization  independent  by  using  a  diversity 
scheme  [7],  circular  birefringent  fiber  [8],  or  possibly  by  man¬ 


aging  the  input  power  condition.  The  demultiplexer  scheme,  is 
also  well  suited  for  simultaneous  demultiplexing  by  using  both 
slopes  of  the  control  pulse,  and  control  pulses  at  multiple  wave¬ 
lengths,  which  is  subject  to  further  investigation. 
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WDM  to  OTDM  Multiplexing  Using  an  Ultrafast 
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Abstract — A  robust  and  scalable  all-optical  multiplexer  com¬ 
bining  four  10-Gb/s  WDM  channels  into  one  40-Gb/s  OTDM 
channel  is  presented.  The  multiplexer  generates  a  coherent  output 
data  stream,  which  does  not  suffer  from  channel  interference  as 
passively  generated  OTDM  data  do. 

Index  Terms — Fiber  optics,  optical  multiplexing,  optical  net¬ 
works. 


I.  Introduction 

THE  RAPID  increase  in  the  demand  for  more  optical  trans¬ 
mission  bandwidth  has  led  to  a  great  interest  in  increasing 
the  bit  rate  of  individual  data  channels  in  wavelength-division- 
multiplexed  (WDM)  networks.  The  highest  bit  rate  of  single 
channel  transmitters  in  deployed  networks  today  is  10  Gb/s, 
even  though  40-Gb/s  transmitters  are  expected  in  the  near  fu¬ 
ture.  An  alternative  way  to  accomplish  higher  bit  rates  is  to 
optically  time-division-multiplex  (OTDM)  multiple  10-Gb/s  re- 
tum-to-zero  (RZ)  data  streams  together  using  passive-time  in¬ 
terleaving  multiplexers.  The  traditional  method  of  doing  this  is 
by  having  a  10-GHz  clock  source,  usually  a  mode-locked  or 
gain-switched  laser,  generates  short  pulses  and  splits  the  clock 
signal  into  the  desired  number  of  channels,  and  subsequently 
encode  data  on  each  of  them.  All  channels  are  then  time  in¬ 
terleaved  in  optical  combiners  with  appropriate  time  delays  be¬ 
tween  each  channel.  The  main  problem  with  this  method  is  that 
very  high  quality  clock  pulses  are  necessary  to  avoid  interfer¬ 
ence  problems  between  adjacent  channels  [1],  leading  to  a  vari¬ 
ation  in  pulse  amplitude  and,  if  the  pulses  are  not  perfectly 
coherent,  a  time  varying  drift  in  amplitude.  In  practice,  very 
short  pulses,  typically  less  than  one  quarter  of  a  bit  slot  with 
high  extinction  ratio  (>30  dB)  are  required  to  achieve  a  noise¬ 
less  OTDM  data  sequence.  Here,  we  demonstrate  a  multiplexer 
that  takes  four  channels  from  a  WDM  transmitter  with  nonre- 
tum-to-zero  (NRZ)  format  and  convert  them  to  NRZ  format, 
and  subsequently  wavelength  convert  all  four  WDM  channels 
to  one  wavelength  to  obtain  one  OTDM  channel.  The  advan¬ 
tage  of  this  scheme  is  that  existing  WDM  transmitter  technology 
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can  be  used,  while  increasing  the  single  channel  transmission  bit 
rate,  as  well  as  eliminating  some  problems  with  passive  OTDM 
multiplexing.  For  example,  the  long  coherence  length  of  the 
local  laser  determines  the  coherence  length  of  the  output  optical 
data.  WDM  to  time-division-multiplexing  (TDM)  conversion 
has  previously  been  demonstrated  using  semiconductor  optical 
amplifier  (SOA)  technology  [2],  as  well  as  using  the  nonlinear 
optical  loop  minor  (NOLM)  [3].  However,  the  SOA  technology 
has  a  limited  frequency  response  and  even  at  40  Gb/s,  it  is  dif¬ 
ficult  to  obtain  short  enough  pulses  at  the  output  to  constitute  a 
RZ  signal.  The  NOLM  is  in  general,  not  suitable  for  wavelength 
conversion,  since  the  pulses  that  induce  cross-phase  modulation 
(XPM)  have  to  be  very  short  compared  to  the  bit  slot  to  avoid 
crosstalk  due  to  unwanted  XPM  from  the  high  average  power 
[4]. 

In  this  experiment,  we  use  an  ultrafast  wavelength  converter 
based  on  XPM  in  an  optical  fiber  [5],  which  in  principle  can 
allow  operation  at  several  hundred  gigabits  [6].  An  advantage 
with  this  scheme  is  that  the  requirements  on  the  RZ  pulses  are 
lower  due  to  the  nonlinear  transfer  function  of  the  wavelength 
converter,  which  allows  pulses  with  moderate  extinction  ratio  to 
be  used.  The  output  of  the  wavelength  converter  is  given  by  the 
derivative  of  the  input  signal,  which  also  allows  broader  pulses 
to  be  used  at  the  input. 

II.  Experimental  Results  and  Discussion 

The  basic  idea  is  to  demonstrate  conversion  of  4-10  Gb/s 
WDM  channels  to  one  40-Gb/s  OTDM  channel.  Fig.  1  shows 
the  experimental  setup  used  for  the  multiplexer  demonstration. 
Four  tunable  lasers  generated  continuous-wave  (CW)  light  at 
1544.0, 1545.6, 1547.2,  and  1548.8  nm.  10-Gb/s  pseudorandom 
bit  stream  (PRBS)  NRZ  data  with  a  word  length  of  231  —  1  was 
encoded  on  all  wavelengths,  and  adjacent  wavelength  channels 
were  encoded  with  different  data.  The  four  WDM  channels 
were  then  converted  to  RZ  data  by  injecting  the  synchronized 
WDM  channels  into  an  electroabsorption  modulator  (EAM) 
followed  by  a  200-m  dispersion  compensating  fiber  (DCF) 
{D  =  -67  ps/nm  *  km).  With  this  arrangement,  pulses  with  a 
pulsewidth  between  8-9  ps  were  obtained  for  all  four  WDM 
channels.  Care  was  taken  to  equalize  the  output  pulsewidth 
between  the  channels  by  adjusting  the  dc-bias  point  of  the  EAM 
without  leading  to  a  significant  crosstalk  in  the  EAM.  The  total 
loss  in  the  EAM  was  25  dB  at  the  chosen  operating  point,  and  it 
was  a  tradeoff  between  maximum  allowed  input  power  to  avoid 
crosstalk  in  the  EAM,  and  signal-to-noise  ratio  (SNR)  at  the 
output  of  the  following  erbium-doped  fiber  amplifier  (EDFA). 
Here  the  total  average  power  into  the  EAM  was  +9  dBm,  which 
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WDM  transmitter  NRZ-RZ  converter 

r 


Fig.  1.  Experimental  setup.  TECL:  Tunable  external  cavity  laser.  PC:  Polarization  controller;  MOD:  LiXb03  external  modulator.  EDFA:  EAM;  FBG:  DCF; 
DSF:  Dispersion  shifted  fiber.  BPF:  0.4-nm  optical  bandpass  filter. 


gave  rise  to  a  moderate  crosstalk  between  the  WDM  channels. 
The  DCF  after  the  EAM  served  as  a  pulse  compressor,  since 
the  pulses  from  the  EAM  had  some  positive  chirp,  but  also  to 
align  the  four  RZ  data  channels  to  be  delayed  25  ps  between 
each  channel.  By  looking  at  the  four  wavelength  channels 
in  the  time  domain,  e.g.,  with  a  high-speed  oscilloscope, 
the  signal  looked  like  a  40-Gb/s  OTDM  data  stream,  even 
though  each  channel  resides  on  a  different  wavelength.  Since 
the  EDFA  after  the  EAM  added  noise,  a  fiber  Bragg  grating 
(FBG)  was  used  to  remove  amplified  spontaneous  emission 
(ASE)  noise  at  the  wavelength  of  the  output  of  the  wavelength 
converter.  The  WDM  RZ  signals  were  then  combined  with  CW 
light  at  1554.5  nm  and  amplified  in  an  EDFA  to  an  average 
power  of  350  mW.  An  850-m  dispersion  shifted  fiber  (DSF) 
with  a  zero  dispersion  wavelength  of  1550  nm  was  used  to 
impose  a  phase  modulation  onto  the  CW  light  by  XPM  in  the 
fiber.  This  phase  modulation  was  then  converted  to  amplitude 
modulation  by  a  filter  arrangement,  which  consisted  of  a  FBG 
to  remove  the  intense  original  data  channels  signals,  as  well  as 
an  optical  bandpass  filter  to  select  one  of  the  generated  side 
bands  of  the  CW  light  [5].  The  output  40-Gb/s  OTDM  data 
were  investigated  using  a  40-GHz  photo  detector  on  a  50-GHz 
sampling  oscilloscope.  An  EAM  driven  at  10  GHz  was  used  to 
demultiplex  the  40-Gb/s  data  to  10  Gb/s,  where  bit-error-rate 
(BER)  measurements  were  performed. 

Fig.  2  shows  the  optical  spectrum  at  various  points  of  the 
system.  The  solid  line  shows  the  spectrum  into  the  wavelength 
converter  with  the  four  10-Gb/s  RZ  signals  and  the  CW  light, 
the  dashed  line  shows  the  spectrum  after  the  DSF  in  the  wave¬ 
length  converter,  and  the  dotted  line  shows  the  output  signal 
from  the  wavelength  converter.  Fig.  3(a)  shows  the  eye  pattern 
of  the  4  x  10  Gb/s  WDM  channels  after  the  EAM  and  the  DCF. 
Although  each  channel  is  on  its  own  wavelength,  they  now  look 
like  a  40-Gb/s  OTDM  data  stream.  Fig.  3(b)  shows  the  output 
40-Gb/s  OTDM  data  after  the  wavelength  converter,  which  is  a 
clearly  open  40-Gb/s  eye  pattern.  BER  measurements  were  per¬ 
formed  on  the  output  40-Gb/s  data  by  demultiplexing  the  data 
into  four  10-Gb/s  data  channels  and  the  BER  of  all  four  chan¬ 
nels  were  measured  and  are  presented  in  Fig.  4.  Fig.  4  also  shows 
the  back-to-back  measurements  for  one  of  the  original  10-Gb/s 
channels  after  conversion  from  NRZ  to  RZ  without  the  pres¬ 
ence  of  the  other  WDM  channels  in  the  EAM.  A  3-dB  receiver 
penalty  at  a  BER  of  10~9  was  observed  for  all  channels  in  the 


Wavelength  [nm] 


Fig.  2.  Spectrum  of  the  input  signal  to  the  EDFA  in  the  wavelength  converter 
(solid  line),  spectrum  after  the  DSF  (dashed  line),  and  spectrum  of  the  output 
40-Gb/s  OTDM  data  (dotted  line).  The  output  extinction  ratio  is  >20  dBm. 


Fig.  3.  (a)  Eye  patterns  of  the  4-  10-Gb/s  WDM  channels  before  wavelength 
conversion,  (b)  The  output  40-Gb/s  OTDM  data  after  wavelength  conversion. 


OLSSON  et  al.  \  WDM  TO  OTDM  MULTIPLEXING 


1007 


Average  received  optical  power  [dBm] 

Fig.  4.  BER  measurements  of  one  input  10-Gb/s  channel,  back-to-back  (■), 
and  the  four  channels  of  the  output  40-Gb/s  OTDM  data  stream. 


Fig.  5.  10-Gb/s  RZ  data  sent  through  a  passive  KMO-Gb/s  OTDM 

multiplexer.  The  extinction  ratio  of  the  pulses  in  approximately  20  dB 
(10  ps/div). 

output  40-Gb/s  OTDM  data.  This  penalty  is  primarily  due  to 
crosstalk  between  the  WDM  channels  in  the  first  EAM  in  the 
NRZ  to  RZ  conversion  process,  and  also  partly  due  to  the  high 
loss  in  the  EAM,  which  decreases  the  signal-to-noise  ratio 
(SNR)  after  the  following  EDFA.  We  believe  that  an  EAM  with 
less  total  insertion  loss  would  eliminate  this  penalty  since  a 
higher  SNR  would  then  be  maintained  and  the  input  power  into 
the  EAM  can  be  lower  to  reduce  the  crosstalk. 

An  advantage  with  this  wavelength  converter  scheme  is  that 
only  moderate  requirements  on  the  input  RZ-data  channels 
are  required,  since  the  transmission  through  the  wavelength 
converter  is  nonlinear  and  conversion  is  only  intensity  depen¬ 
dent.  This  means  that  the  extinction  ratio  of  the  RZ  pulses  is 


not  critical.  The  output  data  is  now  perfectly  coherent  and  no 
interference  effects  occur  between  adjacent  channels  in  the 
OTDM  pulse  train,  which  can  be  present  in  classic  passive 
OTDM  multiplexers.  To  demonstrate  the  difference  between 
the  presented  multiplexer  and  a  passive  OTDM  multiplexer, 
the  10-Gb/s  RZ-data  of  channel  1  was  sent  into  a  passive 
split-and-interleave  10-40-Gb/s  multiplexer.  Fig.  5  shows  the 
output  40-Gb/s  eye  pattern  of  that  OTDM  multiplexer  and  the 
eyes  are  heavily  distorted  and  unstable  due  to  interference 
between  adjacent  pulses,  which  occurs  due  to  a  limited  ER 
from  the  pulse  source  (ER  20  dB). 

in.  Conclusion 

A  novel  WDM  to  OTDM  multiplexer  using  an  ultrafast 
wavelength  converter  is  presented.  The  multiplexer  allows  high 
quality  conversion  from  four  NRZ  WDM  channels  at  10-Gb/s 
to  one  40-Gb/s  OTDM  channel.  The  3-dB  power  penalty 
observed  in  the  output  40-Gb/s  data  stream  can  be  eliminated  if 
an  EAM  with  lower  loss  or  higher  saturation  power  is  obtained. 
The  system  is  scalable  both  in  terms  of  the  number  of  input 
channels,  only  limited  by  the  bandwidth  of  the  wavelength 
converter  [6],  as  well  as  to  higher  input  bit  rates  in  the  incoming 
WDM  channels. 
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LP01  mode  to  the  cladding  modes  supported  by  the  glass-air  inter¬ 
face.  The  thick  line  is  the  transmission  taken  while  the  fibre  was  in 
index-matching  fluid.  As  can  be  seen,  the  coupling  peaks  for  the 
cladding  modes  in  the  index  matching  fluid  case  are  reduced  due 
to  a  decrease  in  guidance  and  the  peak  wavelengths  move  towards 
longer  wavelength  due  to  an  increase  of  the  modal  effective  refrac¬ 
tive  index.  Meanwhile,  the  peak  for  coupling  into  the  LP11  mode 
is  not  affected  by  the  changing  environment  of  the  fibre,  giving 
strong  evidence  that  this  mode  is  a  core  mode  which  has  virtually 
no  power  distribution  at  the  glass-air  interface  of  the  fibre.  Cou¬ 
pling  into  the  leaky  LP1 1  mode  is  also  very  much  stronger  than 
the  strongest  peak  for  the  cladding  mode  coupling,  ~5  dB  in  this 
case.  Coupling  into  the  leaky  LP1 1  mode  can  be  further  improved 
by  having  a  grating  with  strong  non-circular  symmetry  and  appro¬ 
priate  fibre  design. 


Fig.  4  Temperature  sensitivity  of  long  period  grating  in  depressed  clad¬ 
ding  fibre 

•  data 

- linear  fit,  0.26nm/°C 


A  long  period  grating  was  written  into  the  fibre  used  above  to 
demonstrate  this  method.  The  fibre  was  H2-loaded  before  a  ~2cm 
grating  was  written  by  a  193nm  ArF  excimer  laser  using  an  ampli¬ 
tude  mask.  The  transmission  of  the  fibre  was  monitored  and  then 
translated  into  absorption.  This  is  plotted  in  Fig.  3.  The  tempera¬ 
ture  sensitivity  of  the  grating  is  also  measured  and  plotted  in  Fig. 
4.  The  sensitivity  was  measured  as  0.26nm/°C  for  this  grating. 
This  is  typical  for  long  period  gratings. 

To  summarise,  we  have  demonstrated  a  new  method  for  use  in 
long  period  gratings,  which  has  enhanced  coupling  strength  and  is 
insensitive  to  the  fibre  glass-air  interface. 
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Compact  InGaAsP/InP  1x2  optical  switch  based 
on  carrier  induced  supression  of  modal 
interference 

G.A.  Fish,  L.A.  Coldren  and  S.P.  DenBaars 


Indexing  terms:  Optical  switches.  Photonic  switching 


A  novel,  compact  optical  switch  designed  for  use  with  integrated 
amplifiers  to  create  more  scalable  photonic  crossconnects  is 
reported.  The  basic  operation  of  the  switch  is  described  and 
experimentally  verified.  On/off  ratios  of  16.2dB  (7.5dB)  in  the  bar 
(cross)  state  axe  demonstrated  with  currents  <  30mA  and  a  very 
low  bar  state  excess  loss  of  0.5dB. 

Introduction :  InGaA$P/InP  based  photonic  integrated  cross  con¬ 
nects  are  rapidly  maturing  and  will  be  very  attractive  components 
for  future  optical  networks.  Current  research  in  this  area  can  be 
loosely  devided  into  three  catagories:  purely  passive  switches  such 
as  the  digital  optical  switch  (DOS)  [1]  and  Mach-Zender  interfer¬ 
ometer  (MZI)  [2]  or  directional  coupler  based  switches  [3],  com¬ 
pletely  active  switches  involving  amplifier  gates  [4],  and 
combination  passive  switches  and  amplifying  gates,  such  as  the  5° 
or  COSTA  [5].  The  last  of  these  categories  shows  the  most  prom¬ 
ise  for  large  NxN  crossconnects,  since  it  has  achieved  very  low 
insertion  loss  and  crosstalk  levels,  which  would  be  difficult  to 
obtain  with  purely  passive  switches,  while  maintaining  a  scalability 
which  is  not  possible  in  a  gate  array. 

The  switching  elements  used  in  the  COSTA  are  based  on  carrier 
induced  total  internal  reflection  (TIR)  [6]  and  exhibit  high  switch¬ 
ing  currents  and/or  large  excess  losses  typical  of  these  devices 
[5,  7,  8]  which  may  impose  limitations  on  scalability.  In  this  Let¬ 
ter,  we  present  a  novel  method  for  optical  switching  and  its  appli¬ 
cation  towards  fabricating  compact  switches  suitable  for 
integration  with  amplifiers  in  a  expandable  crossconnect. 


cross 


b 

Fig.  1  Schematic  diagrams  of  SMI  switch 

a  Input  waveguide  coupled  off-centre  to  TMI  region  whose  output  is 
altered  when  current  is  applied 

b  Integration  with  amplifier  and  additional  switch  to  form  single  ele¬ 
ment  of  crossconnect 


Switch  design  and  fabrication:  Switches  for  use  in  conjunction  with 
amplifiers  in  nonbroadcasting  architectures  (e.g.  the  5s)  have 
slightly  different  criteria  than  stand  alone  switching  elements. 
Namely,  the  bar  state  excess  loss  of  the  switch  should  be  made  as 
low  as  possible,  as  most  of  the  switches  encountered  in  a  non¬ 
broadcasting  crossconnect  will  be  in  the  bar  state.  The  cross  state 
loss  is  less  important  and  it  is  only  neccessary  that  the  integrated 
amplifier  can  easily  overcome  it.  The  switching  current  should  also 
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be  as  small  as  possible  to  avoid  heating  and  to  simplify  driver  circuits, 
as  multiple  switch/amplifier  pairs  will  be  operated  simultaneously. 

To  achieve  these  requirements  at  the  same  time  in  an  amplified 
switch,  we  have  developed  the  structure  shown  in  Fig.  la.  which 
we  refer  to  as  a  supressed  modal  interference  (SMI)  switch.  The 
basic  principle  is  novel  and  involves  injecting  current  into  a  multi- 
mode  inteference  section  in  such  a  way  as  to  change  its  imaging 
properties  to  achieve  a  different  output.  To  create  a  1x2  switch,  a 
singlemode  input  waveguide  was  coupled  off  centre  to  a  two-mode 
interference  (TMI)  section  which  is  precisely  one  beat  length  long. 


-10  0  10  20  -10  0  10  20 
lateral  distance,  pm 

Fig.  2  Beam  propagation  method  (. BPM)  simulations  of  SMI  switch  in 
operation 

a  Modal  interference  causes  optical  field  to  bypass  curve  in  bar  state 
b  Suppression  of  interference  with  current  injection  directs  field  into 
curve  in  cross  state 


to  be  observed,  as  shown  in  Fig.  3b.  To  separately  characterise  the 
SMI  switch  without  amplificatiori,  the  curved  amplifier  was 
pumped  as  an  LED  (X  -  1540nm)  and  light  was  coupled  out  of 
the  switch  input  using  a  lensed  (R  =  9pm)  optical  fibre.  The  bar 
state  was  similarly  characterised  using  an  additional  on-chip  LED 
contained  in  this  path.  The  on/off  ratios  of  the  switch  in  the  cross 
and  bar  states  were  measured  as  7.5  and  16.2dB,  respectively,  with 
<  30mA  of  injected  current,  as  illustrated  in  Fig.  3 a.  The  intial  dip 
in  the  contrast  ratio  of  the  cross  state  is  due  to  the  SMI  switch 
being  slightly  longer  than  one  beat  length.  The  light  output  of  the 
bar  state  was  compared  to  that  of  several  straight  waveguides  with 
no  switches  to  give  an  estimate  of  0.5dB  for  the  bar  state  excess 
loss. 
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Fig.  3  Contrast  ratios  and  near  field  image  of  SMI  switch 

a  Contrast  ratios  of  bar  and  cross  states  with  applied  current  showing 
16.2dB  and  7.5dB  contrasts,  respectively,  with  <  30mA 
b  Near  field  image  of  end  of  SMI  switch  of  length  L*  as  current  is 
applied,  showing  supression  of  modal  interference 


Similar  to  2x2  TIR  switches  [7]  of  the  past,  in  the  absence  of  cur¬ 
rent,  the  input  mode  excites  the  modes  of  the  TMI  section  and  its 
image  appears  on  the  opposite  half  of  the  TMI  section  after  trav¬ 
elling  one  beat  length,  as  shown  in  Fig.  2a.  By  injecting  current  on 
the  unimplanted  half  of  the  TMI  section,  its  mode  "spectrum  is 
changed  until  only  a  singlemode  exists,  supressing  modal  interfer¬ 
ence.  and  the  light  is  guided  straight  into  the  curved  amplifier  sec¬ 
tion.  Fig.  2b.  Beam  propagation  method  (BPM)  simulations 
predict  that  as  much  as  70%  of  the  light  can  be  coupled  into  the 
curved  section  while  maintaining  only  0.3dB  of  excess  loss  in  the 
bar  state. 

Combining  the  SMI  switches  with  a  small  radius  amplifier  to 
create  a  very  compact  crossconnect  element,  shown  in  Fig.  \b.  is 
complicated  by  the  different  lateral  index  contrast  required  by  the 
two  elements.  Switches  must  have  small  index  contrasts  to  be 
effective,  while  small  radius  bends  require  large  index  contrasts  to 
reduce  bending  loss.  The  solution  of  this  problem  is  to  etch  addi¬ 
tional  cladding  material  from  the  outside  of  the  curve,  as  illus¬ 
trated  in  Fig.  1  b. 

Fabrication  of  the  amplified  SMI  switch  requires  only  two 
MOCVD  growth  steps  whereby  a  base  structure  with  an  MQW 
active  region  is  grown  on  top  of  0.35pm  thick  1.35  Q  waveguides 
selected  to  give  high  index  change  against  absorption  loss  when 
injecting  carriers  [9].  The  active  region  is  removed  in  the  passive 
sections  of  the  device  and  the  p-cladding  and  contact  layers  are 
regrown  including  a  stop  etch  layer  to  defme  the  lateral  index  con¬ 
trast  of  the  switch  region.  Ridge  waveguides  are  dry-etched  using 
CH/HyAr  followed  by  a  wet  H3P04/HC1  clean  up  etch  to  the  stop 
etch  layer.  This  gives  a  well  controlled,  uniform  waveguide  width 
and  lateral  index  constrast  required  for  these  type  of  devices  [10]. 
Further  etching  on  the  outside  of  the  curve  section  follows  using 
Br/H-,POd/H20.  Finally,  the  devices  are  covered  with  SiN,,  metal¬ 
lised,  and  proton  implanted  to  isolate  the  various  sections  and 
define  the  unbiased  half  of  the  SMI  switch. 

Switch  characteristics:  To  verify  the  BPM  analysis,  the  near  field 
pattern  at  the  end  of  various  length  SMI  switches  was  imaged 
using  an  IR  camera  and  light  from  an  on-chip  LED  in  front  of  the 
switch.  This  proved  to  be  very  useful,  allowing  the  beat  length  of 
130  pm  to  be  measured  directly  and  the  evolution  of  the  switching 


Conclusion:  We  have  presented  a  novel  method  for  optical  switch¬ 
ing  by  modifying  the  imaging  properties  of  modal  interference  sec¬ 
tions  through  current  injection  (supressed  modal  interference)  and 
have  applied  this  method  to  fabricate  1x2  switches  that  can  be 
used  in  conjunction  with  amplifiers  to  form  lossless,  scalable,  opti¬ 
cal  crossconnects.  The  switches  exhibited  a  very  low  excess  bar 
state  loss  of  0.5dB  and  on/off  ratios  of  16dB  (bar)  and  7.5dB 
(cross)  with  switching  currents  <  30mA. 
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Performance  of  behaviour  of  scattering 
mechanisms  in  time-scale  domain  against  random 
noise  for  centimetre  wave  radar 

T.  Le-Tien  and  D.T.  Nguyen 


Indexing  terms:  Radar,  Wavelet  transforms.  Scattering,  Random 
noise 

The  behaviour  of  scattering  mechanisms  in  the  time-scale  domain 
against  random  noise  is  presented  for  stepped  frequency  radar. 
The  down  range  profile  is  obtained  from  the  continuous  wavelet 
transform  (CWT)  of  the  backscattered  data.  Local  maxima 
detected  corresponding  to  different  noise  levels  confirm  the 
invariant  location  and  number  of  scattering  centres  in  the  range 
profile.  They  provide  crucial  information  for  automatic  target 
recognition. 

Introduction :  Radar  is  a  technique  for  detecting  and  measuring  the 
location  and  characteristics  of  objects  which  reflect  electromag¬ 
netic  energy  coming  from  an  antenna  system.  The  scattering  mech¬ 
anisms  which  describe  the  behaviour  of  objects  to  the  incident 
waves  can  be  separated  into  two  major  categories:  namely  disper¬ 
sive  and  nondispersive  scattering  mechanisms.  For  nondispersive 
scattering  such  as  comers,  edges,  or  specular  reflections,  the  local 
scattering  features  can  be  extracted  from  the  local  maxima  of  the 
target  response  in  the  time  domain.  The  local  maxima  are  located 
in  the  early  time  portion  of  the  backscattered  data  and  represent 
the  highest  energy  concentration.  Their  appearances  are  com¬ 
monly  sharp  in  the  time  domain  but  spread  out  in  the  frequency 
domain,  whereas  for  frequency  dispersive  scattering  such  as  from 
some  radar  absorbing  materials,  ducts  or  inlets  [1,  6],  the  corre¬ 
sponding  impulse  response  will  be  spread  in  the  time  domain.  The 
response  of  the  frequency  dispersive  scattering  gives  information 
about  the  global  features  in  the  frequency  domain  rather  than  the 
local  features  in  the  time  domain. 

The  inverse  scattering  problem  of  identifying  objects  in  the  far 
field  using  backscattered  data  has  been  of  interest  only  on  nondis¬ 
persive  scattering  mechanisms  which  reflect  most  energy  of  the 
incident  wave.  It  is  well-known  that  at  sufficiently  high  frequen¬ 
cies,  the  target  can  be  approximated  as  a  set  of  points  on  the 
object,  namely  isolated  nondispersive  scattering  centres  [2.  4].  The 
response  of  the  object  is  assumed  to  be  a  sum  of  individual 
responses  of  scattering  centres.  The  range  profile  for  colocated 
transmit/receive  antennae  represents  the  scattering  distribution  of 
the  object  along  the  radial  distance.  It  is  obtained  from  the  con¬ 
version  of  the  backscattered  data  (frequency  domain)  into  the 
range  domain  [5].  It  provides  information  about  the  strength,  loca¬ 
tion,  and  number  of  individual  scattering  centres.  The  crucial 
information  about  the  object  can  be  estimated  from  the  local 
maxima  of  the  response  in  the  range  profile.  Some  salient  features 
extracted  from  this  information  can  then  be  used  as  feature  vec¬ 
tors  for  automatic  target  recognition. 

In  this  Letter,  the  continuous  wavelet  transform  is  applied  to 
backscattered  data  with  various  noise  levels  in  the  frequency 


domain.  The  wavelet  transformed  signal  at  a  specific  scale  accord¬ 
ing  to  the  number  of  frequency  samples  of  the  data  is  converted 
into  the  range  domain  to  obtain  the  down  range  profile.  The  local 
maxima  corresponding  to  scattering  information  are  then  detected 
against  different  random  noise  levels.  They  demonstrate  how  the 
scattering  centres  are  sensitive  to  random  noise. 


Noisy  range  profile  based  on  the  CWT:  The  CWT  of  a  signal  fit) 
corresponding  to  F(co)  in  frequency  domain  is  defined  [3]  as 

CWT(a,b)  =  (Vo/Sir)  J  F(uW(auj)exp{jwb)ch  (1) 

where  <t>(co)  is  the  Fourier  transform  of  the  mother  wavelet  $(?),  a 
is  the  scale  and  b  is  the  translating  factor.  Eqn.  1  is  equivalent  to 
filtering  the  signal  fit)  with  the  bandpass  filter  <h(a(D)  whose  band¬ 
width  changes  according  to  the  scale  a. 

The  signaler)  can  be  recovered  from  its  CWT  as  follows: 

}{t)  =  (1  /C)  Jj  CWT\a.  b)<j>c,,t,(t){l/a~)dadb  (2) 

CO 

where 


C  =  J (|$(w)|2/|w|)du;  <  oo  (3) 

OC 

is  the  admissibility  condition  of  the  CWT. 

Let  the  incident  electric  field  with  the  wave  number  k  =  oVc, 
where  c  is  the  speed  of  light,  be 

Einc(r)  =  E0  exp  (4) 

The  backscattered  field  as  measured  by  a  linearly  polarised  receiv¬ 
ing  antenna  can  be  expressed  as  [2,  4] 

AT 

E{k,z)  ~  {E0(exp(jkz)/z)}  Y,  An(k)  exp{-j2kz.fn) 

n— 1 

(5) 


where  7 n  is  the  location  of  «th  scattering  centre,  f  is  the  unit  vec¬ 
tor  in  z  direction  and  A„(k)  is  a  weighting  coefficient  determined 
by  the  scattering  process.  By  normalising  electric  field  for  a  given 
polarisation  [4],  eqn.  5  can  be  rewritten  as 

N 

E(k)  =  Y  An{k)  exp(-j2krn)  (6) 

n= 1 

In  noisy  environments,  the  backscattered  data  can  be  expressed  as 


N 

En(k)  =  Y,  Mk)  exp(-;2fcrn)  +  N(k)  (7) 

71=1 

where  N(k)  is  Gaussian  noise  corresponding  to  the  /rth  sample. 

The  Morlet  Wavelet  in  the  normalised-frequency  domain  (wave 
number  domain)  is  given  by 

<p(k)  =  (y/wja)exp(-(k  -  k0f  / 4a)  (8) 


where  a  and  kQ  are  constants,  k  =  (Nc  is  the  normalised  frequency 
in  the  continuous  wavelet  transform. 

Then  the  CWT  of  eqn.  7  can  be  obtained  as  follows: 


CWT(a,b)  — 


An(k)  exp{-(ak-k0)2/4<x) 


x  exp(- jfc(2rn  -  b))dk  +  CWTn(a,  b)  (9) 


where  CWT„(a,  b )  is  the  CWT  of  N{k).  It  can  be  seen  from  eqn.  9 
that  the  amplitude  of  scattering  centres  vary  according  to  the 
weighting  coefficients,  the  value  of  a ,  and  CWT  fa,  b). 


Simulation  results:  The  backscattered  data  is  obtained  from  128 
stepped  frequency  samples  reflected  from  an  aircraft  model 
observed  using  a  centimetre  wave  radar.  The  starting  frequency 
and  frequency  step  are  chosen  to  be  10  GHz  and  20  MHz,  respec¬ 
tively.  The  Morlet  mother  wavelet  has  values  of  a  and  k0  equal  to 
0.5  and  2,  respectively.  The  signal- to-noise  ratio  (SNR)  is  chosen 
to  range  from  6  to  30dB  in  the  simulation.  This  is  taken  into 
account  for  every  stepped  frequency  sample  of  the  backscattered 
data. 

Figs.  1  and  2  show  the  frequency  responses  of  the  model  and 
their  CWT-based  range  profile  with  the  SNR  of  6  and  12dB, 
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Suppressed  Modal  Interference  Switches 
with  Integrated  Curved  Amplifiers  for 
Scaleable  Photonic  Crossconnects 

Gregory  A.  Fish,  Larry  A.  Coldren,  Fellow,  IEEE,  and  Stephen  P.  DenBaars 


Abstract —  A  novel,  compact  optical  switch  that  incorporates 
two  1x2  suppressed  modal  interference  (SMI)  switches  with  an 
integrated  curved  amplifier  is  demonstrated  as  a  basic  component 
for  scaleable,  lossless  photonic  crossconnects.  The  basic  operation 
of  the  switch  is  described  and  measured  on/off  ratios  of  22  dB  (16 
dB)  are  reported  for  the  cross  (bar)  state  with  switching  currents 
of  less  than  25  mA  per  section.  Lossless  switching  was  attained 
in  the  amplified  cross  state,  while  only  0.5  dB  of  excess  loss 
was  measured  in  the  unamplified  bar  state.  Cascading  of  these 
elements  should  lead  to  A  x  A  lossless,  optical  crossconnects 
with  crosstalk  levels  near  40  dB. 

Index  Terms — Integrated  optoeletronics,  optical  switches. 


I.  Introduction 

PHOTONIC  space  switches  are  very  attractive  components 
for  future  optical  networks.  Considerable  effort  and  a 
variety  of  methods  have  been  applied  to  meet  their  princi¬ 
ple  requirements,  namely,  low  insertion,  low  crosstalk,  wide 
optical  bandwidth,  polarization  insensitivity,  fast  switching 
speed,  scalability,  and  small  size.  Both  passive  and  active 
semiconductor  space  switches  in  InGaAsP-InP  have  achieved 
very  promising  results  and  have  nearly  fulfilled  all  of  the 
above  requirements  at  a  size  of  4  x  4  [l]-[4].  An  important 
remaining  issue  involves  maintaining  these  levels  of  perfor¬ 
mance  as  the  devices  are  scaled  to  larger  and  larger  arrays.  For 
example,  in  laser  amplifier  gate  arrays  the  insertion  loss  and 
crosstalk  scales  with  the  array  size.  Clearly,  large  arrays  will 
have  difficulty  maintaining  0-dB  insertion  loss  and  begin  to 
suffer  crosstalk  penalties.  While  in  passive  switches,  although 
the  insertion  loss  and  crosstalk  does  not  increase  as  severely 
per  additional  channel,  the  physical  size  becomes  such  that 
larger  arrays  are  not  as  tractable.  Combining  several  smaller 
arrays  into  larger  ones,  while  more  feasible,  is  not  as  attractive 
as  one  monolithic  device  from  a  packaging  perspective  and  has 
been  predicted  to  have  degraded  performance  in  simulations 

[5]. 

To  address  these  issues  we  report  a  novel  compact  optical 
switch  with  a  scaleable  architecture  that  still  provides  zero 
insertion  loss  and  high  contrast.  As  in  the  single-slip-structure 
(S  )  [6],  we  utilize  a  combination  of  switching  elements  and  a 
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traveling-wave  optical  amplifier  which  has  been  shown  to  be 
very  effective  in  providing  lossless,  low-crosstalk  switching 
while  maintaining  scaleability  [3], 

The  switching  elements  used  in  our  design  are  novel  and 
based  on  carrier  injection  induced  changes  in  a  modal  inter¬ 
ference  coupler,  called  suppressed  modal  interference  (SMI) 
switching,  the  details  of  which  are  previously  published  [8] 
and  summarized  below.  To  give  a  veiy  compact  basic  element 
for  a  crossconnect,  two  SMI  switching  elements  are  integrated 
with  a  small  radius  (250  pm)  curved  amplifier  to  form  a 
complete  optically  amplified  suppressed  modal  interference 
switch  (OASIS)  illustrated  in  Fig.  1(a).  As  shown,  the  input 
and  output  waveguides  are  orthogonal  for  increased  compact¬ 
ness  and  reduced  crosstalk  between  crossing  guides,  a  problem 
appearing  in  lower  angle  switch  architectures  [3],  [4].  By 
combining  16  OASIS  elements  as  shown  in  Fig.  1(b),  a  4  x  4 
crossconnect  that  occupies  only  a  few  square  millimeters  is 
achievable.  This  represents  a  significant  reduction  in  size 
over  existing  crossconnects,  and  allows  cascading  of  4  x  4 
or  bigger  blocks  on  chip  with  additional  amplifiers  to  achieve 
even  larger  arrays  while  maintaining  zero  insertion  loss. 

II  §WITCH  p)ESIGN  and  Fabrication 

To  gain  an  improvement  over  an  amplifier  gate  array  that 
simply  uses  splitters,  the  SMI  switches  used  in  the  OASIS  have 
very  little  excess  loss  in  the  bar  state,  yet  switch  enough  power 
into  the  amplified  path  that  the  cross  state  loss  can  be  zero. 
The  principle  by  which  the  SMI  switch  operates  is  injecting 
carriers  to  modify  the  lateral  index  profile  of  a  multimode 
interference  (MMI)  section  to  produce  a  different  output  at  the 
end  of  the  MMI  section.  In  detail,  a  3-^m-wide  input  ridge 
waveguide  is  coupled  off  center  to  a  6-^m-wide  multimode 
interference  section,  as  shown  in  Fig.  1(a).  The  interference 
section  has  two  modes  and  is  exactly  one  beat  length  long 
(150  fzm,  in  this  case)  such  that  the  input  field,  starting  on  the 
left-hand  side  of  the  MMI  section  is  inverted  at  the  output  and 
appears  on  the  right-hand  side  of  the  MMI  where  it  couples 
into  the  3-^m-wide  bar  state  output  waveguide.  To  switch 
to  the  cross  state,  current  is  injected  on  the  left  half  of  the 
MMI  section  modifying  the  lateral  index  profile  such  that 
the  higher  order  modes  are  cut  off,  suppressing  the  modal 
interference  and  the  light  is  guided  down  the  right  half  of  the 
MMI  section  and  into  the  curved  cross  state  output  waveguide. 
Some  important  aspects  in  the  design  of  the  switch  involve 
tailoring  the  lateral  index  profile  of  the  waveguides  to  have 
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(a) 


(b) 


Fig.  1.  (a)  Schematic  diagram  of  an  OASIS  element  showing  the  integration 

of  two  SMI  switches  with  a  small  radius  curved  amplifier,  (b)  By  cascading 
16  OASIS  elements  very  compact  4x4  «4  mm2)  crossconnects  can  be 
achieved. 


high  confinement  of  the  mode,  yet  still  be  weakly  guided 
enough  to  allow  for  switching  with  modest  injection  levels. 
We  chose  a  lateral  index  contrast  of  0.02  to  satisfy  this  criteria. 
It  is  also  important  to  confine  the  injected  carriers  to  the  left 
half  of  the  MMI  section  when  switching,  so  a  series  of  H4- 
implants  are  performed  to  prevent  conduction  from  the  right 
to  the  left  half  of  the  MMI  ridge. 

Combining  the  SMI  switches  with  a  250-/im  radius  curved 
amplifier  is  complicated  by  the  different  lateral  index  contrast 
required  by  the  two  elements.  Switches  must  have  small 
index  contrasts  to  be  effective  (less  than  0.03)  while  small 
radius  bends  require  large  index  contrasts  (greater  than  0.06) 
to  reduce  bending  loss.  To  solve  this  problem,  we  etched 
additional  cladding  material  from  the  outside  of  the  curve, 
as  shown  in  Fig.  1(a). 

Fabrication  of  the  OASIS  involves  only  two  MOCVD 
growth  steps  in  which  a  base  structure  with  an  offset  MQW 
(four  1%  compressive  InGaAsP  wells)  active  region  grown 
on  top  of  0.35  ^m,  InGaAsP  (Xg  =  1.35  /mi,  i.e.,  1.35 
Q)  waveguides.  The  active  region  is  removed  in  the  passive 
sections  of  the  device  and  the  p-cladding  and  contact  layers  are 
regrown  including  a  1.1-Q  stop  etch  layer  (located  0.14  fim 
above  the  waveguide)  to  define  the  lateral  index  contrast  (cho¬ 
sen  to  be  about  0.02)  of  the  passive  waveguides.  3-/zm-wide 
ridge  waveguides  are  dry  etched  using  CH4-H2-Ar  followed 
by  a  wet  H3PO4-HCI  clean  up  etch  to  the  stop  etch  layer, 
this  gives  a  well-controlled  uniform  lateral  waveguide  contrast 
required  for  these  type  of  devices  [7].  Further  etching  on  the 
outside  of  the  curve  section  follows  using  Br-HaPO^ILO. 


Om$ 


Fig.  2.  Schematic  diagram  of  the  testing  setup  illustrating  the  monitored 
LED  sources  and  lensed  fiber  used  to  test  the  switching  response  of  an  OASIS 
element  consisting  of  two  switches  (Si  and  S2)  and  a  curved  amplifier  (.4i ). 


Current  Applied  to  Each  Switch,  Igi&  IS2(mA) 
(a) 


(b) 


Fig.  3.  (a)  Cross-state  output  relative  to  that  at  zero  bias  as  the  current 

to  both  switches  is  increased  at  a  given  amplifier  current  level,  (b)  Cross 
and  bar  outputs  relative  to  straight  reference  guides  as  current  is  applied 
simultaneously  to  each  switch  and  the  amplifier  section. 


Finally,  the  devices  are  covered  with  SiNx,  metallized  and 
proton  implanted  to  isolate  the  various  sections  and  define  the 
unbiased  half  of  the  SMI  switches. 

in.  Device  Characteristics 

On-chip  MQW  LED’s  (A  =  1540  nm)  having  the  same  layer 
structure  as  the  curved  amplifier  were  used  as  sources  to  test 
the  switching  characteristics  of  the  OASIS.  While  monitoring 
the  back  side  power,  light  from  the  front  side  of  a  LED  was 
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sent  through  an  OASIS  element  and  collected  at  either  the  bar 
or  cross  state  output  with  a  lensed  (radius  9  yum)  single-mode 
optical  fiber  and  a  lock-in  amplifier,  as  illustrated  in  Fig.  2. 
To  evaluate  the  excess  loss  in  the  switch,  several  straight 
waveguides  without  switches  were  used  to  provide  a  reference 
level.  Fig.  3(a)  shows  the  performance  of  an  OASIS  at  various 
levels  of  amplifier  current  as  the  current  to  injected  into  each 
SMI  switches  is  increased.  On-off  ratios  of  nearly  23  dB  are 
achieved  with  less  than  25  mA  of  current  injected  into  each  of 
the  sections.  The  reversal  of  the  on-off  ratio  at  switch  currents 
higher  than  20  mA  is  attributed  to  mode  mismatch  between  the 
input  guide  and  the  modified  modal  interference  region  which 
was  observed  in  near  field  images  taken  of  the  end  of  the  SMI 
switch  [8].  In  Fig.  3(b)  the  bar-  and  cross-state  powers  are 
compared  to  reference  waveguides  showing  that  the  OASIS 
has  an  excess  loss  of  around  0.5  dB  in  the  bar  state  and  can 
achieve  an  “excess  gain”  of  nearly  2  dB  in  the  cross  state  with 
contrast  ratios  of  16  and  22  dB,  respectively.  When  multiple 
OASIS  elements  are  combined  in  a  TV  x  N  crossconnect  as 
shown  in  Fig.  1(a),  output  is  taken  only  from  the  cross  state, 
so  these  measured  on-off  ratios  could  be  combined  to  give 
expected  crosstalk  levels  near  40  dB  for  most  nonbroadcasting 
switch  configurations. 

IV.  Conclusion 

We  have  presented  the  results  of  a  novel  1  x  2  amplified 
switch  (OASIS)  where  a  contrast  of  over  22  dB  has  been 
achieved  without  loss  in  the  cross  state  and  only  0.5  dB  of 
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excess  bar  state  loss.  Orthogonal  inputs  and  outputs  should 
allow  very  compact,  easily  expandable  crossconnects  with 
reduced  crosstalk  to  be  created  from  multiple  OASIS  elements. 
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Compact,  4x4  InGaAsP-InP  Optical  Crossconnect 
with  a  Scaleable  Architecture 

Gregory  A.  Fish,  Beck  Mason,  Larry  A.  Coldren,  Fellow ,  IEEE ,  and  Steven  P.  DenBaars 


Abstract — An  InP-InGaAsP  4x4  optical  crossconnect  occu¬ 
pying  only  2x2  mm2  was  demonstrated  by  combining  16  1  x  2 
optically  amplified  suppressed  interference  switches  (OASIS)  in 
a  square  array  architecture.  On/off  ratios  of  33  ±  4  dB  were 
measured  for  a  majority  of  these  elements  and  fiber-to-fiber 
insertion  loss  of  3  dB  was  obtained  using  additional  on-chip 
amplification  stages. 

Index  Terms — Integrated  optoelectronics,  optical  switches. 


I.  Introduction 

NCREASINGLY,  optical  crossconnects  are  being  proposed 
as  the  basis  for  various  photonic  packet  switching  nodes  [1], 

[2] .  Many  of  these  strategies  involve  using  the  crossconnects  to 
provide  simultaneous  packet  routing  and  buffering  of  multiple 
input  channels  through  the  use  of  optical  fiber  delay  lines.  In 
addition  to  the  typical  optical  switching  requirements,  (e.g., 
low  crosstalk,  low  insertion  loss,  and  high  switching  speed) 
these  applications  demand  crossconnects  with  a  large  number 
of  channels,  as  several  buffering  channels  are  usually  needed  to 
provide  lossless  packet  routing  for  relatively  few  data  channels 
[1]. 

Crossconnects  based  upon  semiconductor  optical  amplifiers 
(SOA)  gates  are  well  suited  for  these  applications,  as  they  are 
compact  and  have  demonstrated  lossless,  low  crosstalk  (—50 
dB)  switching,  with  speeds  sufficient  for  cell  routing  (2  ns) 

[3] .  Unfortunately,  most  SOA  gate  arrays  to  date  use  tree 
architectures  where  the  loss  per  channel  increases  as  N 2  (N 
being  the  number  of  channels),  so  it  is  unlikely  these  strategies 
can  be  scaled  to  significantly  larger  sizes.  Combining  optical 
switches  with  SOA’s  has  been  shown  to  be  a  very  effective 
method  in  creating  a  more  scaleable  crossconnect  which  can 
have  lower  loss  and  crosstalk  than  a  simple  gate  array  [4]. 
Building  upon  this  concept,  we  demonstrate  a  very  compact 
4x4  (2x2  mm2)  optical  crossconnect  using  sixteen  1x2 
optically  amplified  suppressed  interference  switches  (OASIS) 
[5]  elements  as  shown  in  Fig.  1(a).  By  arranging  the  sixteen 
elements  in  the  square  array  architecture  shown  in  Fig.  1(b), 
the  average  insertion  loss  reduces  to  roughly  N  times  the 
excess  loss  of  the  optical  switches  [called  the  suppressed 
modal  interference  (SMI)  switch]  within  an  OASIS  element, 
which  can  be  made  small  (~0.5  dB)  [6]. 
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Fig.  1.  (a)  Schematic  diagram  of  a  1x2  OASIS  element  illustrating  the 
integration  of  a  curved  SOA  with  two  SMI  switches,  (b)  Sixteen  1  x  2  OASIS 
have  been  combined  to  form  a  very  compact  (2x2  mm2)  4x4  optical 
crossconnect  with  orthogonal  input  and  output  waveguides.  Input/output  stage 
SOA’s  were  integrated  to  help  overcome  fiber  coupling  losses. 

II.  Switch  Design  and  Fabrication 

The  details  of  the  design  of  the  SMI  switches  and  their 
integration  with  a  SOA  to  form  the  OASIS  have  been  pre¬ 
viously  reported  [5],  [6].  The  operation  of  the  SMI  switch 
can  be  summarized  as  follows.  The  input  waveguide  of  the 
SMI  switch  is  coupled  to  a  modal  interference  region  which  is 
designed  to  image  the  input  onto  the  straight  bar  state  output 
waveguide,  as  shown  in  Fig.  1(a).  By  injecting  current  into  the 
switching  electrode,  the  width  of  modal  interference  section  is 
effectively  reduced,  creating  a  singlemode  waveguide  which 
guides  the  light  into  the  curved  SOA  located  in  the  cross  state 
output  waveguide.  The  published  measurements  of  the  SMI 
switches  have  shown  they  can  provide  16  dB  (8  dB)  contrast 
ratios  for  the  bar  (cross)  state  with  an  excess  loss  of  0.5  dB 
for  the  bar  state  and  3-4  dB  (estimated)  for  the  cross  state. 
Adding  the  amplifier  to  the  cross  state  improved  its  contrast 
to  22  dB  with  a  net  gain  of  2  dB. 
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Fig.  2.  On-chip  measurements  of  the  on/off  ratios  of  a  number  of  OASIS 
dements  of  the  4  x  4  crossconnect.  The  current  to  the  SOA  and  two  SMI 
switches  is  swept  simultaneously  from  the  off  state  to  the  on  state.  The  off 
state  of  the  SOA  and  the  SMI  switches  was  at  0  and  8  mA,  respectively. 

Fabrication  of  the  OASIS  involves  only  two  MOCVD 
growth  steps  in  which  a  base  structure  with  a  MQW  (four 
1%  compressive  InGaAsP  wells)  active  region  grown  on  top 
of  0.35-^m-thick  InGaAsP  (Xff  =  1.35  ^m,  i.e.,  135-Q) 
waveguides.  The  active  region  is  removed  in  the  passive 
sections  of  the  device  and  the  p-cladding  and  contact  layers 
are  regrown  including  a  1.1  -Q  stop  etch  layer  (located  0.14 
fj,m  above  the  waveguide)  to  define  the  lateral  index  contrast 
(chosen  to  be  about  0.02)  of  the  passive  waveguides  and  SMI 
switches.  3-^m-wide  ridge  waveguides  are  dry  etched  using 
CH4/H2/AT  followed  by  a  wet  H3PO4/HCI  clean  up  etch  to 
the  stop  etch  layer.  This  gives  a  well  controlled  uniform  lateral 
waveguide  contrast  required  for  these  type  of  devices.  Further 
dry  etching  on  the  outside  of  the  curved  section  increases  the 
waveguide's  lateral  index  contrast  to  reduce  the  bending  losses 
that  occur  at  this  small  radius  (250  jum).  Finally,  the  devices 
are  covered  with  SiNx,  metallized  and  proton  implanted  to 
isolate  the  various  sections  and  define  the  unbiased  half  of  the 
SMI  switches. 

III.  Device  Characteristics 

Prior  to  testing,  a  quarter-wavelength-thick  SiOT  antire¬ 
flection  coating  was  applied  to  the  input  and  output  facets 
of  a  4  x  4  OASIS  crossconnect.  To  initially  test  the  on-chip 
performance  of  the  device,  the  input  optical  amplifier  of  a 
particular  channel  is  pumped  as  an  LED  and  its  emission 
is  collected  (using  a  9-yum  radius  lensed  optical  fiber  and 
a  lock-in  amplifier)  from  an  orthogonal  output  channel  after 
traversing  through  an  OASIS  element.  Fig.  2  shows  the  mea¬ 
sured  on/off  ratio  of  several  of  the  OASIS  elements  as  the 
current  to  the  SOA  and  the  two  SMI  switches  are  increased 
simultaneously  from  the  off  state  to  the  on  state.  The  off 
state  of  the  SMI  switches  in  this  array  was  at  8  mA,  due 
to  the  narrowing  of  the  waveguides  during  processing.  Future 
processing  adjustments  will  create  SMI  switches  with  zero  bias 
off  states,  as  originally  designed.  Measurements  on  eleven  of 
the  sixteen  functioning  OASIS  (some  were  damaged  during 


Fig.  3.  Fiber-to-fiber  transmission  of  I2O2  (input  channel  2  to  output  channel 
2)  illustrating  a  -3-dB  insertion  loss,  39-dB  on/off  ratio,  and  -38-dB 
crosstalk  contributions  to  the  other  channels.  The  input/output  stage  SOA’s 
are  biased  at  constant  60  mA  for  each  channel. 


Wavelength  (nm) 


Fig.  4.  Fiber  to  fiber  transmission  of  I4O]  illustrating  operation  over  a 
45 -nm  wavelength  span  with  an  insertion  loss  within  3  dB  of  the  minimum 
while  maintaining  at  least  a  39-dB  on/off  ratio  throughout. 

testing)  yielded  an  average  on/off  ratio  of  33  ±4  dB.  On 
chip  measurements  of  the  crosstalk  contributions  of  an  input 
channel  to  undesired  output  channels  were  very  similar  to  the 
off  state  of  the  given  OASIS  element  indicating  that  the  90° 
crossing  angle  of  input  and  output  waveguides  is  effective  at 
eliminating  crosstalk  at  these  junctions. 

Fiber-to-fiber  measurements  were  conducted  using  lensed 
fibers  (R  ~  9  yum)  and  an  external  cavity  tunable  laser  emitting 
at  1550  nm  with  a  optical  power  of  -20  dBm  as  the  source.  To 
help  overcome  fiber  coupling  losses  (5-6  dB/facet)  both  input 
and  output  amplifiers  of  the  measured  channel  were  biased 
at  a  constant  60  mA  throughout  the  measurements.  Fig.  3 
shows  the  fiber-to-fiber  transmission  of  I202  (input  channel 
2  to  output  channel  2)  indicating  an  insertion  loss  of  3  dB, 
a  39-dB  on/off  ratio,  and  —38-dB  crosstalk  contributions  to 
the  other  output  channels.  The  fiber-to-fiber  transmission  and 
on/off  ratios  were  measured  versus  wavelength  using  these 
same  conditions  (input/output  SOA  biased  at  60  mA)  for 
channel  I4Ox,  and  are  illustrated  in  Fig.  4.  The  insertion  loss 
remains  within  3  dB  of  the  nominal  minimum  over  a  45 -nm 
range  with  an  on/off  ratio  better  than  38  dB. 
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Fig.  5.  Estimated  on-chip  loss  of  channels  I1O2,  I2O2,  I3O2,  and  I4O2. 
The  high  value  of  loss  was  attributed  to  excess  roughness  in  the  curved 
SOA  waveguide.  The  small  increase  in  loss  as  additional  SMI  switches  are 
encountered  demonstrates  the  enhanced  scalability  of  this  design. 


All  the  above  measurements  were  carried  out  using  the 
best  case  polarization  (lowest  insertion  loss).  The  worst-case 
polarization  at  1540  nm  for  channel  I4O1  has  an  additional  14 
dB  of  insertion  loss,  but  maintains  a  35  dB  on  off  ratio.  The 
origin  of  this  polarization  sensitivity  is  due  (almost  entirely) 
to  the  compressively  strained  MQW  active  regions,  as  BPM 
simulations  of  the  SMI  switches  predict  negligible  polarization 
sensitivity.  By  incorporating  polarization  insensitive  MQW 
active  regions  [7],  [8],  this  problem  can  be  eliminated. 

Although  good  on/off  ratio  performance  was  obtained,  the 
on-chip  losses  (estimated  to  be  10-15  dB)  were  much  higher 
than  expected  based  on  past  measurements  of  single  1  x  2  OA¬ 
SIS  elements  [5].  This  loss  was  attributed  to  excess  roughness 
in  the  curved  waveguides  that  was  not  present  in  past  1x2 
devices.  Fig.  5  shows  the  estimated  on-chip  losses  of  various 
paths  through  the  array  without  the  use  of  the  input/output 
amplifier  stages.  The  loss  is  estimated  by  comparing  the  light 
emitted  from  the  back  of  the  input  SOA  (which  is  used  as 
a  source)  to  the  light  level  at  the  output  of  the  switch  and 
subtracting  the  gain  of  the  output  SOA.  Despite  the  large  total 
loss  of  around  10  dB,  the  loss  increases  by  only  0.4  dB  for  each 


SMI  switch  in  the  path,  confirming  the  enhanced  scalability  of 
this  architecture. 


IV.  Conclusion 

A  very  compact  4x4  optical  crossconnect  (2x2  mm2) 
with  a  more  scaleable  architecture  was  demonstrated.  On/off 
ratios  of  33  ±4  dB  were  measured  for  a  majority  of  the 
channels.  A  fiber-to-fiber  insertion  loss  of  3  dB  was  obtained 
with  the  use  of  additional  input  and  output  stage  SOA’s. 
Present  high  on-chip  losses  of  10  to  15  dB  should  be  reduced 
with  future  improvements  to  the  fabrication  process;  thus  0-dB 
insertion  loss  should  also  be  obtainable.  Despite  this  high 
total  loss,  paths  including  additional  switches  had  only  0.4 
dB  per  switch  more  loss,  confirming  the  scaleable  nature  of 
this  design. 
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Abstract —  Low-loss  single-mode  semiconductor  rib  optical 
waveguides  fabricated  in  GaAs-AlGaAs  epitaxial  layers  are 
removed  from  GaAs  substrates  and  bonded  to  transfer  substrates 
using  a  benzocyclobutene  organic  polymer.  Optical  quality  facets 
were  obtained  by  cleaving  through  the  transfer  substrate.  An 
average  propagation  loss  of  039  and  0.48  dB/cm  at  135  ftm 
wavelength  for  TE  and  TM  polarizations,  respectively,  were 
measured.  This  was  on  average  0.05  dB/cm  greater  than  control 
guides  fabricated  in  GaAs-AlGaAs  epilayers  on  GaAs  substrates 
with  air  as  the  top  cladding.  This  demonstrates  the  feasibility 
of  a  process  enabling  semiconductor  polymer  integration  and 
processing  both  sides  of  an  epitaxial  layer. 

Index  Terms — GaAs-AlGaAs  materials/devices,  optical  device 
fabrication,  optical  waveguides,  optoelectronic  devices,  organic 
polymer  materials/devices. 

I.  Introduction 

FOR  ALL  integrated  optoelectronic  devices  functionality 
is  provided  by  the  epitaxial  layer.  In  most  cases  a  sub¬ 
strate  is  not  desirable  but  has  to  be  present  for  epitaxial 
growth  and  handling  during  processing.  There  are  significant 
disadvantages  associated  with  the  presence  of  the  substrate. 
All  compound  semiconductor  substrates  have  high  relative 
dielectric  constants  resulting  in  excessive  capacitance.  Com¬ 
bined  with  the  high  sheet  resistance  at  microwave  frequen¬ 
cies,  substrates  hinder  the  performance  of  high-speed  devices 
such  as  modulators  [1]  and  photodetectors  [2].  Compound 
semiconductor  substrates  also  have  high  thermal  resistances 
resulting  in  inefficient  removal  of  heat  and  thermal  crosstalk. 
Therefore,  removing  the  substrate  eliminates  these  difficulties 
while  introducing  further  advantages.  After  removing  the 
substrate,  the  remaining  epilayers  can  be  bonded  to  transfer 
substrates  using  organic  polymers  resulting  in  semiconductor 
polymer  integration.  Hence,  low  propagation  loss,  low  cost, 
and  easier  fiber  pigtailing  of  polymer  waveguides  can  be 
combined  with  the  superior  electrooptic  properties  of  semi- 
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conductor  waveguides.  Furthermore  the  ability  to  pattern  and 
process  both  sides  of  an  epilayer  enables  new  and  novel  high 
performance  devices.  In  this  letter,  we  investigate  the  feasi¬ 
bility  of  removing  high  performance  optical  waveguides  from 
their  substrates  and  bonding  them  to  transfer  substrates  using 
organic  polymers.  We  also  characterize  these  waveguides  to 
see  how  they  perform  with  the  addition  of  new  processing 
steps  and  materials.  Such  waveguides  could  be  the  basic 
building  blocks  of  future  high-performance  optoelectronic 
devices. 

II.  Waveguide  Fabrication 

Cross-sectional  profiles  of  the  two  types  of  waveguides 
fabricated  in  this  letter  are  shown  in  Fig.  1.  Fig.  1(a)  shows  the 
substrate-removed  (SURE)  waveguides  fabricated  in  gallium 
arsenide/aluminum  gallium  arsenide  (GaAs-AlGaAs)  epilay¬ 
ers  and  embedded  in  an  organic  polymer  called  benzocy¬ 
clobutene  (BCB).  This  polymer  is  known  to  produce  easily 
fiber  pigtailed  high  quality  optical  waveguides  [3].  Fig.  1(b) 
shows  a  conventional  waveguide  fabricated  using  the  same 
material  structure.  These  types  are  used  as  control  waveguides. 

First,  an  appropriate  unintentionally  doped  epitaxial  layer 
was  designed  and  grown  by  molecular  beam  epitaxy  (MBE). 
The  epitaxial  layer  shown  in  Fig.  1  is  the  same  for  both  types 
of  waveguides  and  is  designed  for  optical  waveguiding  at 
1.55  /xm.  It  consists  of  a  2.05-^m-thick  Alo.3Gao.7As  bottom 
cladding  to  prevent  the  leakage  of  the  guided  mode  of  the 
conventional  waveguide  into  the  substrate,  a  0.73-yixm  GaAs 
core  layer,  and  a  0.8-/xm  Alo.3Gao.7As  top  cladding  layer. 
There  is  also  a  0.1-/xm-thick  AlAs  layer  between  the  bottom 
cladding  and  the  substrate.  This  layer  is  used  as  an  etch 
stop  layer  during  substrate  removal.  Two  separate  samples 
were  cleaved  from  this  wafer.  The  first  sample  was  used  for 
the  novel  SURE  waveguides  and  the  other  for  the  control 
waveguides. 

On  both  epitaxial  layers,  straight  4-^m-wide  waveguides 
were  patterned  with  photoresist  using  standard  lithography. 
The  single-mode  guides  were  then  wet  etched  5000  A  using  a 
10: 1  1M  citric  acid:H202  solution.  At  this  point  the  fabrication 
of  the  control  sample  was  complete  and  it  was  cleaved, 
mounted  and  measured. 

To  continue  with  the  fabrication  of  the  SURE  waveguides 
a  mechanical  grade  semi-insulating  GaAs  transfer  substrate 
was  solvent  cleaned,  spin  coated  first  with  AP-8000  adhesion 
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Fig.  1.  (a)  Schematic  cross-sectional  profile  of  the  SURE  waveguide,  (b) 
Conventional  rib  waveguide. 

promoter  and  then  with  9-jum  of  Cyclotene  3022-57  BCB  [4], 
The  rib  waveguides  fabricated  earlier  were  spin  coated  with  the 
same  adhesion  promoter  and  were  placed  epi  side  down  on  the 
BCB  coated  GaAs  transfer  substrate.  Following  this  a  full  cure 
of  BCB  at  250  °C  for  one  hour  was  performed  in  a  nitrogen 
purged  oven.  When  fully  cured  the  BCB  thin  film  acts  as  a 
glue  that  keeps  the  transfer  substrate  bonded  to  the  substrate 
containing  the  epilayer  and  the  fabricated  waveguides.  In  the 
curing  process  there  are  no  volatile  by-products.1  The  entire 
stack  was  mounted  on  a  glass  microscope  slide,  GaAs  transfer 
substrate  down,  using  wax.  This  exposed  the  growth  substrate 
only.  To  remove  the  growth  substrate  from  the  epi  layer,  a  wet 
spray  etch  was  performed.  This  consisted  of  a  30:1  mixture 
of  H202:NH40H  sprayed  as  a  fine  mist  onto  the  substrate. 
This  etch  stops  on  the  AlAs  etch  stop  layer  after  the  entire 
substrate  is  removed.  After  removing  the  substrate,  the  AlAs 
etch  stop  layer  was  removed  in  a  1:1  HF:H20  solution.  This 
left  the  thin  epi  layer  with  optical  waveguides  on  it  bonded  to 
the  GaAs  transfer  substrate  via  the  BCB  layer.  At  this  stage  the 
waveguides  are  as  shown  in  Fig.  1(a).  The  remaining  exposed 
epi  layer  was  very  smooth  and  uniform  in  spite  of  being  on  top 
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Fig.  2.  SEM  photograph  of  a  cleaved  facet  of  a  waveguide  in  a  substrate 
removed  GaAs-AlGaAs  epitaxial  layer  embedded  in  BCB. 

of  the  BCB.  No  cracks,  waviness  or  bubbles  were  observed  on 
epilayers  as  large  as  2.5  cm  by  2.5  cm.  The  planarity  of  the 
epi  was  good  enough  for  another  lithography.  We  were  able 
to  fabricate  patterns  as  small  as  2  /mi  and  as  long  as  2  cm 
on  such  epilayers  using  conventional  optical  lithography.  Part 
of  the  reason  for  this  uniformity  was  the  lack  of  volatile  by¬ 
products  during  the  curing  of  BCB  and  the  very  smooth  and 
planar  BCB  surface  obtained  after  spinning  and  curing. 

Optical  quality  facets  are  needed  to  characterize  the  SURE 
waveguides.  This  is  achieved  by  cleaving  the  GaAs  transfer 
substrate  together  with  the  epilayer  bonded  to  it.  This  requires 
the  alignment  of  the  crystal  planes  of  the  transfer  substrate 
and  the  substrate  containing  the  epilayer  during  bonding.  We 
observed  that  BCB  becomes  less  viscous  at  high  temperatures 
during  curing,  causing  the  samples  to  move  around  on  top  of 
the  transfer  substrate  coated  with  BCB.  We  took  advantage 
of  this  by  placing  the  samples  on  a  10°  angled  block  after 
a  rough  alignment  of  the  cleaved  edges  of  both  substrates. 
The  substrate  containing  the  epilayer  was  free  to  move  during 
the  cure  so  as  to  allow  the  cleaved  edge  of  the  substrate 
containing  the  epi  layer  to  align  to  the  cleaved  edge  of  the 
BCB  coated  transfer  substrate.  For  such  low  angles,  the  liquid 
tension  near  the  edge  prevented  the  sample  from  falling  off  of 
the  BCB  coated  transfer  substrate.  This  aligned  the  cleavage 
planes  of  the  epilayer  and  the  transfer  substrate  to  a  high 
enough  precision.  Finally,  optical  quality  facets  were  made 
by  cleaving.  This  was  done  by  nicking  the  comer  of  the 
transfer  substrate  with  a  sharp  blade,  then  by  applying  pressure 
to  allow  the  cleave  to  propagate  both  laterally  and  vertically 
through  the  BCB  and  the  epi  layer.  Despite  cleaving  through 
a  ~500- /un-thick  transfer  substrate,  9  fim  of  BCB  polymer, 
and  the  epi  layer,  good  quality  facets  shown  in  Fig.  2  were 
obtained.  It  should  be  pointed  out  that  this  is  not  the  only 
way  to  obtain  optical  quality  facets.  This  approach  is  used  to 
obtain  high-quality  facets  easily  in  order  to  be  able  to  carefully 
study  the  feasibility  of  the  substrate  removal  process.  Dry 
etching  is  another  very  viable  possibility.  This  would  allow 
any  other  substrate  with  good  adhesion  to  BCB  and  resistance 
to  substrate  etching  to  be  used.  For  cleaving,  one  can  use  other 
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Fig.  3.  10  log  {(1  +  V7\)/{1  -  \/K)}  as  a  function  of  different  sample 
lengths  for  control  and  SURE  waveguides  for  TE  and  TM  polarizations. 
K  is  the  ratio  of  the  minima  and  maxima  of  the  transmission  through  the 
waveguides  when  the  input  wavelength  is  changed  slightly. 

substrates  that  can  be  cleaved,  even  noncrystalline  substrates 
that  can  be  broken  along  a  straight  edges. 

III.  Experimental  Results 

Both  waveguide  samples  were  mounted  and  characterized 
on  an  optical  bench.  The  output  of  a  fiber  pigtailed  1.55- 
fim  distributed-feedback  (DFB)  laser  was  coupled  into  the 
waveguides  using  cleaved  fiber.  The  near-field  pattern  at  the 
output  was  imaged  on  a  vidicon  camera  using  a  microscope 
objective  and  a  collimating  lens.  The  image  was  displayed 
on  a  monitor.  We  observed  the  near-field  mode  patterns 
and  measured  the  propagation  loss  of  the  waveguides.  Both 
types  of  waveguides  were  single  mode.  The  Fabry-Perot 
resonance  technique  and  sequential  cleaving  were  used  to 
measure  the  propagation  loss  and  facet  reflectivity  of  the 
waveguides  [4],  [5].  The  output  power  was  detected  using 
a  Ge  photodetector  connected  to  an  optical  power  meter.  We 
varied  the  temperature  of  the  DFB  laser  through  an  external 
thermoelectric  temperature  controller.  This  results  in  slight 
changes  in  the  operating  wavelength  of  the  laser  which  in 
turn  modifies  the  transmission  through  the  Fabry-Perot  cavity 
formed  by  the  cleaved  facets  of  the  optical  waveguides.  Noting 
the  ratio  of  the  minima  and  maxima  of  this  transmission, 


K ,  and  plotting  101og{l  +  V~K)/(  1  -  VK)}  as  function  of 
different  sample  lengths,  one  can  determine  the  propagation 
loss  and  facet  reflectivity  of  the  waveguide.  Specifically, 
the  slope  of  this  plot  is  propagation  loss  in  dB/length  and 
the  intercept  is  101og(i?2),  where  R 2  is  the  facet  power 
reflectivity.  The  experimental  results  are  shown  in  Fig.  3. 

Twenty  waveguides  with  an  initial  length  of  9  mm,  cutback 
to  7  and  2  mm,  were  measured  on  the  control  sample.  A  linear 
fit  to  the  measured  data  produced  an  average  waveguide  loss 
of  0.34  and  0.45  dB/cm  for  TE  and  TM  polarized  inputs, 
respectively.  The  initial  length  of  the  SURE  waveguides  was 
8  mm,  which  were  then  cutback  to  6  and  2  mm.  The  average 
TE  and  TM  propagation  loss  for  ten  such  waveguides  was 
0.39  and  0.48  dB/cm,  respectively.  On  average,  the  loss  of  the 
SURE  waveguides  was  0.05  dB/cm  higher  than  the  control 
waveguides,  which  was  well  within  the  experimental  error 
of  the  measured  values.  This  shows  that  the  propagation 
loss  increase  due  to  this  novel  process  was  marginal.  The 
extrapolated  facet  reflectivity  of  the  control  waveguides  were 
0.36  and  0.25  for  TE  and  TM  polarizations,  respectively. 
Corresponding  values  for  SURE  waveguides  were  0.32  and 
0.26.  The  slight  variation  again  indicates  overall  good  quality 
facets.  No  measurable  variation  was  observed  between  the 
mode  shapes  of  the  two  types  of  waveguides. 

IV.  Conclusion 

We  have  successfully  demonstrated  the  feasibility  of  sub¬ 
strate  removal  and  bonding  the  remaining  epilayer  on  transfer 
substrates  using  BCB  in  the  fabrication  of  optoelectronic 
devices  in  the  GaAs-AlGaAs  material  system.  In  particular, 
optical  waveguides  fabricated  in  epilayers  removed  from  sub¬ 
strates  and  embedded  in  BCB  had  an  average  propagation  loss 
of  0.39  dB/cm  and  0.48  dB/cm  at  1.55  tim  for  TE  and  TM  po¬ 
larizations  respectively.  This  was  only  on  average  0.05  dB/cm 
greater  than  control  guides  fabricated  on  GaAs  substrates  with 
air  as  the  top  cladding.  The  substrate  removal  process  produced 
a  smooth  epi  layer  on  which  fine-line  lithography  was  possible. 
This  demonstrates  the  feasibility  of  processing  both  sides  of  an 
epilayer.  Optical  quality  cleaved  facets  were  also  produced  by 
aligning  the  crystal  axis  of  the  growth  and  transfer  substrates 
during  bonding.  Presently,  we  are  utilizing  this  novel  process 
to  fabricate  low-voltage  high-speed  electrooptic  modulators. 
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A  real-time  spatial-temporal  processor  based  on  cascaded  nonlinearities  converts  space-domain  images  to 
time-domain  waveforms  by  the  interaction  of  spectrally  decomposed  ultrashort  pulses  and  spatially  Fourier- 
j  ormed  images  earned  by  quasi-monochromatic  light  waves.  We  use  four-wave  mixing,  achieved  by  cas¬ 
caded  second-order  nonlinearities  with  type  II  noncollinear  phase  matching,  for  femtosecond-rate  processing. 
We  present  a  detailed  analysis  of  the  nonlinear  mixing  process  with  waves  containing  wide  temporal  and  an¬ 
gular  bandwidths.  The  wide  bandwidths  give  rise  to  phase-mismatch  terms  in  each  process  of  the  cascade 
We  define  a  complex  spatial -temporal  filter  to  characterize  the  effects  of  the  phase-mismatch  terms,  modeling 
the  deviations  from  the  ideal  system  response.  New  experimental  results  that  support  our  findings  are  pre¬ 
sented.  ©  2000  Optical  Society  of  America  [S0740-3224(00)01410-7] 
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1.  INTRODUCTION 

Methods  for  ultrafast  optical  waveform  synthesis  (also  re¬ 
ferred  to  as  pulse  shaping)  have  been  investigated  for  di¬ 
verse  applications  in  the  natural  sciences  and  engineer¬ 
ing.  Reviews  of  various  pulse-shaping  techniques  can  be 
found  in  Refs.  1  and  2.  Based  on  linear-system  theory, 
filtering  the  temporal  -frequency  components  of  an  ul¬ 
trashort  optical  pulse  results  in  high-resolution  waveform 
synthesis.  The  filtering  can  be  achieved  by  spatially  dis¬ 
persing  the  frequency  components  in  a  spectral  process¬ 
ing  device  (SPD),  a  free-space  optical  setup  consisting  of 
diffraction  gratings  and  lenses,  and  inserting  a  spatial 
mask  with  the  encoded  amplitude  and  phase  information. 
After  the  spectral  decomposition  wave3  (SDW)  has  been 
filtered  by  the  spatial  mask,  the  frequency  components 
are  recomposed  in  the  SPD  to  generate  the  synthesized 
temporal  waveform.  Early  pulse-shaping  experiments 
used  prefabricated  masks  to  filter  the  SDW,4  which  later 
were  replaced  by  active-filtering  devices  such  as  spatial 
light  modulators6  and  acousto-optic  modulators.6  This 
approach  may  not  yield  an  adequate  time  response  for 
adaptive  control  of  the  synthesized  waveform  needed  for 
some  applications,  as  it  is  limited  by  computation  time, 
signal  propagation  delay,  and  modulator  response  time. 
Additionally,  most  modulators  operate  with  either  phase 
or  amplitude  modulation,  requiring  a  complicated  cascade 
of  two  devices  for  complete  complex-amplitude  filtering.7 
An  alternative  approach  for  generating  the  spectral  filter 
uses  in  situ  optical  holographic  recording  of  the  interfer¬ 
ence  of  the  spatial  Fourier  transform  (FT)  of  a  spatial  im¬ 
age  and  a  reference  point  source.8’9  The  recorded  spatial- 
frequency  information  serves  as  a  spectral  filter  for  the 
SDW  of  an  incident  ultrashort  reference  pulse.  The  syn¬ 
thesized  waveform  is  correlated  to  the  spatial  image  used 
in  the  recording,  resulting  in  a  space-to-time  converted 
image.  This  approach  can  be  interpreted  as  a  four-wave 
mixing  process  between  two  waves  carrying  spatial  FT  in¬ 
formation  and  two  SDW '’s  with  the  temporal  FT  informa¬ 
tion,  resulting  in  the  exchange  of  information  between  the 
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spatial  and  temporal  channels.  The  holographic  record¬ 
ing  medium’s  characteristics  determine  the  performance 
of  the  information  exchange  between  the  four  interacting 
waves.  For  example,  using  holographic  film  will  require 
a  long  recording  and  processing  time.  Thick  holograms 
formed  in  bulk  photorefractive  crystals  yield  high  diffrac¬ 
tion  efficiency  but  also  require  a  long  recording  time  ow¬ 
ing  to  slow  electron  mobility  essential  in  buildup  of  a 
space-charge  field9  (from  tens  of  microseconds  to  several 
minutes).  Multiple-quantum-well  (MQW)  semiconductor 
photorefractives  can  perform  holographic  recording  with 
a  microsecond  response  time  yet  result  in  low  diffraction 
efficiency  owing  to  the  short  interaction  length.10’11 

For  compatibility  with  the  requirements  of  ultrafast 
applications  such  as  high-speed  optical  communication 
with  ultrashort  pulses  and  quantum  control  of  atomic  and 
molecular  vibrational  states,  the  synthesized  waveform 
must  be  updated,  or  modulated,  at  high  rates  with  both 
amplitude  and  phase  information.  Such  a  femtosecond- 
rate  response  time  can  be  provided  only  by  parametric 
processes  that  involve  bound-electron  nonlinearities. 
Our  space-to-time  conversion  scheme,  first  reported  in 
Ref.  12,  exchanges  the  information  from  a  spatial  image 
to  a  temporal  waveform  by  a  four- wave  mixing  process  in 
a  X{2)  nonlinear  crystal.  The  cascaded  second-order  non¬ 
linearity  (CSN)  arrangement  we  are  utilizing  consists  of  a 
frequency-sum  generation  process  followed  by  a 
frequency-difference  generation  process  that  satisfies  the 
type  II  noncollinear  phase-matching  condition.13  The 
nonlinear  wave  mixing  occurs  in  the  Fourier  domain  of 
the  temporal  and  spatial  channels  (see  Fig.  1).  The 
frequency-sum  process  mixes  the  SDW  of  the  input  ul¬ 
trashort  pulse  and  the  spatial  FT  of  the  spatial-image  in¬ 
formation  illuminated  with  a  quasi-monochromatic  plane 
wave.  The  resultant  intermediate  wave  is  determined  by 
the  product  of  the  complex  amplitudes  of  the  temporal 
and  spatial  FT  fields.  The  cascaded  frequency-difference 
process  mixes  the  intermediate  wave  with  the  spatial  FT 
of  a  point  source  illuminated  by  the  same  quasi- 
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Spatial  image  channel  ®  Polarization  parallel  to  crystal’s  extraordinary  axis 


Fig.  1.  Femtosecond-rate  space-to-time  conversion  setup  based  on  nonlinear  wave  mixing  with  noncollinear  type  II  cascaded  second- 
order  nonlinearities  in  the  Fourier-domain  plane  of  the  temporal  and  spatial  channels.  A  frequency-sum  process  between  waves  U j  and 
JJ2  gives  rise  to  the  wave  Umt .  A  frequency-difference  process  between  the  waves  Umt  and  Uz  generates  the  desired  output  wave  U4 . 


monochromatic  plane  wave.  The  second  spatial  wave 
contains  no  information,  in  either  the  space  or  the  time 
domain,  and  its  function  is  to  downconvert  the  carrier  fre¬ 
quency  of  the  intermediate  wave  generated  by  the  first 
nonlinear  process.  The  resultant  field  in  the  CSN  pro¬ 
cess  is  the  filtered  SDW,  which  is  recomposed  to  yield  the 
synthesized  output  temporal  waveform. 

In  this  paper  we  describe  and  analyze  in  depth  the 
space-to-time  conversion  process  employing  CSN,  achiev¬ 
ing  a  femtosecond  response  time  and  high  conversion  ef¬ 
ficiency.  In  contrast  to  our  initial  analysis  in  the  time 
domain,12  a  temporal-frequency-domain  analysis  is  pre¬ 
sented  in  Section  2,  in  which  we  develop  the  expression 
for  the  synthesized  temporal  waveform  under  the  as¬ 
sumption  of  an  ideal  CSN  process  with  perfect  phase 
matching.  In  Section  3  we  investigate  the  effects  of  the 
phase-mismatch  terms  in  the  cascaded  processes  and 
characterize  them  with  a  complex  spatial-temporal 
transfer  function.  The  derivation  of  the  phase-mismatch 
terms’  dependence  on  the  temporal-  and  spatial-frequency 
bandwidths  is  deferred  to  the  appendix.  Experimental 
results  verifying  the  findings  of  our  analysis  are  pre¬ 
sented  in  Section  4,  illustrating  the  ability  to  convert  both 
amplitude  and  phase  information  with  this  technique. 
We  conclude  with  a  summary  and  discussion  in  Section  5. 


2.  ANALYSIS  OF  THE  SPACE-TO-TIME 
CONVERTER 

The  nonlinear  interaction  between  the  SDW  of  the  input 
ultrashort  pulse  and  the  quasi-monochromatic  waves  at 
the  Fourier  plane  of  the  optical  setup  is  analyzed  in  this 
section  under  the  following  assumptions:  (i)  The  nonlin¬ 
ear  wave-mixing  process  is  weak,  such  that  the  amplitude 
of  the  generated  wave  is  proportional  to  the  product  of  the 
complex  amplitudes  of  the  mixed  waves,  (ii)  The  inter¬ 
acting  waves  are  phase  matched  for  all  temporal  frequen¬ 
cies  contained  in  the  bandwidth  of  the  short  pulse  and 
spatial  frequencies  of  the  spatial  image.  We  defer  the 
justification  and  discussion  on  the  validity  of  this  last  as¬ 
sumption  to  Section  3,  in  which  a  complex  filtering  re¬ 
sponse  function  is  introduced  to  characterize  the  effect  of 


phase  mismatch  on  the  conversion  process.  Since  the 
space-to-time  conversion  occurs  for  one-dimensional  spa¬ 
tial  signals,  the  y-axis  information  is  omitted.  In  the  fol¬ 
lowing,  we  first  analyze  the  input  temporal-  and  spatial- 
information  channels,  followed  by  an  analysis  of  the  CSN 
wave-mixing  process  giving  rise  to  the  synthesized  tem¬ 
poral  waveform. 


A.  Input  Optical  Channels 

An  ultrashort  optical  pulse,  with  a  temporal  envelope 
waveform  of  pit),  is  utilized  at  the  input  temporal  chan¬ 
nel  of  the  SPD.  The  pulse  is  propagating  in  free  space 
toward  a  diffraction  grating,  at  an  angle  6  relative  to  the 
grating  normal  (see  Fig.  1).  The  input  ultrashort  pulse  is 
characterized  in  the  propagating  pulse’s  coordinate  sys¬ 
tem  (x1,z1)  as 

I  Zi 

E pulse(*l.*i;  <)  =  Wl(Xi)p\  t  -  t0  -  — 


x  exp 


j 


—  Zi  “  Q)0t  I  , 


(1) 


where  )  defines  the  transversal  field  distribution  or 
the  spatial  mode  of  the  pulse,  c  is  the  speed  of  light  in 
vacuum,  (o0  is  the  center  optical  frequency,  and  t0  is  an 
arbitrary  time  reference.  The  pulse  is  propagating  in  the 
Zi  direction  at  a  group  velocity  and  a  phase  velocity  of  the 
speed  of  light.  We  assume  the  spatial  mode  size  of  the 
pulse  is  sufficiently  large  to  ignore  diffraction  effects  in 
the  region  of  interest  near  the  grating  for  the  representa¬ 
tion  of  Eq.  (1)  to  be  valid. 

Since  we  are  assuming  that  the  nonlinear  wave  mixing 
of  the  space-to-time  converter  operates  in  the  linear  re¬ 
gime,  the  signal  analysis  may  be  performed  in  the 
temporal-frequency  domain.  We  Fourier  transform  (FT) 
Eq.  (1)  to  perform  a  temporal-frequency  decomposition  of 
the  short  pulse,  yielding 


Epulse(*l>2i;  ®)  =  K>i(*i)exp|./-Z1jp(tO  -  to0) 

X  exp[J(to  -  w0)*o]>  (2) 
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where  the  tilde  overscript  denotes  a  FT,  e.g.,  p(cj) 
=  Jp(t)exp(jo)t)dt.  We  perform  a  rotation  of  the  coordi¬ 
nate  system  from  (xj ,  z{)  to  (x,  z)  for  compatibility  with 
the  coordinate  system  of  the  SPD,  yielding 


Eputee(x,z;  w)  =  wx(-z  sin  0  +  r  cos  0) 


X  exp 


j  “  ( —z  cos  6  -  x  sin  $) 


x  P(<o  -  <o0)exp[j(o)  -  ( o0)t0 ].  (3) 


To  find  the  incident  field  on  the  input  grating  of  the  SPD, 
the  field  of  Eq.  (3)  is  evaluated  at  z  =  0.  The  effect  of  the 
grating  diffraction  can  be  modeled  by  adding  the  grating 
momentum,  kg ,  in  the  x  direction  to  the  k  vector  that 
characterizes  the  propagation  direction  of  the  wave,  re¬ 
sulting  in 


-^'inpu t(**>  0. )) 


u)i(x  cos  0)exp 


-yj-sind-^j* 


Xp(w  -  O>0)exp[j(<o  -  u0)tQ].  (4) 


The  grating  momentum,  kg ,  and  the  incidence  angle,  6, 
are  chosen  such  that  the  center  frequency  <o0  will  diffract 
in  the  direction  of  the  optical  axis  of  the  system.  Setting 
wo  sin  61c  =  kg  and  substituting  the  grating’s  k  vector,  kg 
=  2n/A,  where  A  is  the  grating  period,  yields  sin  6 
—  Kq/A  =  a,  where  \q  is  the  center  wavelength.  To  fur¬ 
ther  simplify  the  notation,  we  define  a  new  spatial- 
aperture  function  such  that  w(x)  =  w^x  cos  ff).  Without 
loss  of  generality  we  also  shift  the  input  aperture  of  the 
beam  from  the  optical  axis  of  the  system  by  D/2.  This  is 
done  to  satisfy  the  noncollinear  beam-propagation  re¬ 
quirement  in  the  Fourier  plane  of  the  SPD  for  phase 
matching  in  the  CSN  process.  The  input  temporal- 
channel  field  to  the  optical  processor,  uM,  is  therefore 
characterized  by 


1  D\ 

ax 

\  ~  2/ exP 

-j(0)  -  too) - 

C 

X  p(a>  -  to0)exp[y(to  -  a>0)t0],  (5) 

We  may  inverse  FT  Eq.  (5)  to  generate  the  input  signal  in 
the  time  domain,  yielding 


utci(x;  t)  2f  \  ~  *°  +  ~jesp(-j«o<). 

(6) 

Equation  (6)  describes  the  short  pulse  scanning  across  the 
fixed  aperture  at  a  velocity  of  —c/a  in  the  x  direction. 
For  convenience  we  may  wish  to  set  the  time-delay  pa¬ 
rameter  such  that  the  traveling  pulse  is  at  the  center  of 
the  aperture  at  t  =  0.  This  is  achieved  by  setting  f0 
=  aD/2c . 

The  temporal-channel  field  of  Eq.  (5)  is  spatially  Fou¬ 
rier  transformed  by  a  lens  of  focal  length  /,  yielding  the 
SDW  U j  of  the  short  pulse, 


ui(x';  <u)  -  p(w  -  <o0)exp[j(co  -  o)0)t0] 

x  Lw(x  - 


ax 

~j(0)  -  OJq)  — 

c 


xx r 

X  exp(  -J2n—)dx, 

a;  j 


(7) 


where  we  defer  solving  this  Fourier  integral  to  the  non¬ 
linear  wave-mixing  analysis  of  Section  3.  The  two  spa¬ 
tial  channels,  one  containing  the  encoding  information 
and  the  second  a  point  source,  are  modeled  next. 

A  mask  containing  spatial-domain  information  is 
placed  m  the  input  plane  of  the  processor,  alongside  the 
diffraction  grating  used  for  the  input  temporal  channel. 
The  mask  is  shifted  from  the  optical  axis  by  -D/2  to  sat¬ 
isfy  the  requirement  of  noncollinear  beam  propagation  in 
the  Fourier  plane  of  the  SPD.  The  information  mask  of 
the  spatial  channel  is  illuminated  with  a  quasi- 
monochromatic  light  source  at  center  frequency  co1 .  The 
first  input  spatial  channel  of  the  processor,  uscl ,  can  be 
expressed  as 


“sd(*>  2  =  0;  t)  =  +  —  jexpi-jcojt),  (8) 

where  m(x)  is  the  spatial-information  mask.  The  first 
input  spatial-channel  field  is  spatially  Fourier  trans¬ 
formed  by  the  lens,  yielding  the  field  U2,  which  we  ex¬ 
press  in  the  temporal-frequency  domain  by  taking  the 
temporal  FT,  generating 


U2(xf;  o>)  —  S(o)  —  o^)  I  m\x  +  — 

J  -OS  \  2 

x  exp(-y2ir^;jdc,  (9) 

where  S(  )  is  the  Dirac  delta  function.  The  spatial  wave 
contains  no  temporal-frequency  bandwidth  owing  to  our 
illumination  with  a  quasi-monochromatic  fight  source. 

The  second  spatial  channel  consists  of  a  point  source  at 
the  input  plane  that  is  illuminated  by  the  same  quasi- 
monochromatic  fight  source  as  the  first  spatial  channel. 
The  point  source  is  shifted  from  the  optical  axis  by  —D/2, 
as  the  first  spatial  channel  has  been.  A  polarization- 
selective  beam  splitter  (see  PBS  in  Fig.  1)  is  used  for  effi¬ 
cient  superposition  of  the  two  spatial  channels  necessary 
for  the  wave-mixing  process.  The  second  input  spatial- 
channel  field  to  the  processor,  usc2 ,  can  be  expressed  as 

u&c2(x;  t)  -  S^x  +  —  jexpf-jfc)!*).  (10) 

The  second  input  spatial-channel  field  is  also  spatially 
Fourier  transformed  by  the  lens,  yielding  the  field  in  the 
Fourier  plane  U3,  expressed  in  the  temporal-frequency 
domain  as 
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f»  /  D\ 
U3(x';  (o)  =  8(a>  -  8\^x  +  — 


/  **'\ 

X  exp|  dx.  (11) 

We  next  examine  the  space-time  conversion  process  by 
nonlinear  wave  mixing  the  three  waves  defined  in  Eqs. 
(7),  (9),  and  (11). 

B.  Four- Wave  Mixing  by  the  Cascaded  Second-Order 
Nonlinearity  Process 

A  nonlinear  crystal  exhibiting  a  large  nonlinear  suscepti¬ 
bility  ^(2)  is  placed  at  the  Fourier  plane  of  the  SPD  (see 
Fig.  1).  The  two  optical  waves,  Ui  and  U 2 ,  from  the  in¬ 
put  temporal  channel  and  the  first  spatial  channel  inter¬ 
act  within  the  crystal,  giving  rise  to  an  intermediate  wave 
U-mi  in  a  frequency-sum  process.  The  type  II  noncol- 
linear  phase-matching  condition  is  satisfied  by  (i)  adjust¬ 
ing  the  polarization  directions  of  the  temporal  and  spatial 
channels  to  coincide  with  the  crystal’s  ordinary  and  ex¬ 
traordinary  directions,  respectively,  and  (ii)  setting  the 
interacting  input  waves  to  propagate  noncollinearly. 
The  second  condition  is  satisfied  by  the  spatial  separation 
of  the  input  channels,  resulting  in  a  propagation- 
direction-difference  angle  of  Dlf  between  U\  and  t/2  ■ 
The  complex  amplitude  of  the  generated  intermediate 
wave  is  proportional  to  the  nonlinear  polarization  arising 
from  the  two  fundamental  waves,  yielding 

Uint(x’;  to)  «  f  <0  -  Cl)U2(x';  H)dft 

J  -co 

=  X&W*';  «  "  «i )U2(x';  co,),  (12) 

where  xf«  is  the  effective  nonlinear-susceptibility  coeffi- 
cient  at  the  propagation  directions  of  the  interacting 
waves,  and  we  assume  it  to  be  independent  of  co  in  the 
temporal-frequency  bandwidth  of  interest.  The  convolu¬ 
tion  integral  describing  the  nonlinear  polarization  is 
trivial  to  solve  since  U2  has  no  temporal-frequency  band¬ 
width.  The  intermediate  wave  is  therefore  shifted  up  in 
frequency  by  cox ,  such  that  is  oscillating  at  a  center 
frequency  of  co0  +  a >i  and  is  polarized  in  the  extraordi¬ 
nary  direction  owing  to  the  type  II  interaction. 

Next,  we  consider  the  interaction  of  the  third  wave  U 3 
with  the  other  waves  in  the  nonlinear  crystal.  Since  the 
crystal  exhibits  type  II  phase  matching,  there  is  no  inter¬ 
action  between  the  wave  from  the  temporal  channel  U i 
and  the  wave  from  the  second  spatial  channel  £/3,  as  they 
are  both  polarized  in  the  ordinary-axis  direction  of  the 
crystal.  The  collinear  waves  from  the  two  spatial  chan¬ 
nels  produce  a  second-harmonic  wave,  but  it  is  of  no  in¬ 
terest  for  our  study  of  the  spatial-temporal  processor. 
The  signal  of  interest  is  generated  from  the  interaction  of 
the  wave  from  the  second  spatial  channel  Uz  and  the  in¬ 
termediate  wave  t/int  generated  in  the  frequency-sum 
process.  These  two  waves  are  orthogonally  polarized  and 
satisfy  the  noncollinear  phase-matching  condition  for  a 
frequency-difference  generation  process.  The  noncol¬ 
linear  phase-matching  condition  is  automatically  satisfied 
because  U2  and  U3  have  the  same  optical  frequency  and 


copropagate.  The  interaction  of  Uz  and  Uint  gives  rise  to 
a  fourth  wave  U4  at  the  output  of  the  nonlinear  crystal, 
which  is  our  signal  of  interest,  given  by 

U4(x';  CO)  «  x'eff  f  a  «  +  WK*')  0)dfl 

=  X^U^ix';  co  +  «a) 

=  <o)U2(x';  co^Ulix';  co,). 

(13) 

The  nonlinear  polarization  has  the  form  of  a  correlation 
integral  in  the  frequency-difference  process.  This  inte¬ 
gral  is  again  trivial  to  solve,  as  the  second  spatial  channel 
is  also  quasimonochromatic.  The  output  wave  is  equiva¬ 
lent  to  a  four-wave  mixing  process  achieved  by  cascaded 
second-order  nonlinearities.  The  resultant  wave,  C/4,  is 
the  SDW  of  the  synthesized  waveform  with  center  fre¬ 
quency  co0  ,  generated  by  filtering  the  input  SDW  U1  by  a 
spatially  modulated  wave.  It  is  copropagating  with  U i 
and  polarized  in  the  extraordinary  direction  (as  U^t  is  ex¬ 
traordinary  and  Uz  is  ordinary).  Polarization  optics  may 
be  used  to  separate  the  copropagating  waves  U1  and  U4 
(see  Fig.  1). 

The  generated  field  U4  is  spatially  Fourier  transformed 
by  a  lens  of  focal  length  f,  yielding  the  optical  field  on  the 
output  diffraction  grating,  given  by 

Utc2{x”;  «) 

=  J  U4(xr;  co)exp(  -j-^r)dr' 
a  p((o  -  <o0)exp[j(a>  -  co0)£0] 


X  dxrdx16x2dxz.  (14) 

Integrating  over  the  variables  x ' ,  x2 ,  and  xz  yields 
utc 2(r";  o>)  =  p(co  -  co0)exp [j(a>  -  <o0)*0] 

/  D\  \ 

mxi  ~  +  *1) 

ax  i 

X  exp  -j(oi  -  co0) - dx1.  (15) 

[  c  j 

Assuming  AX  <§  X0  for  typical  ultrashort  pulses,  we  can 
also  eliminate  the  wavelength  dependence  in  the  encod¬ 
ing  mask  by  replacing  X  with  X0 .  It  is  possible  to  conduct 
the  analysis  without  this  assumption,  resulting  in  weakly 
chirped  output  signals.14  We  next  perform  a  change  of 
variable  by  defining  a  new  integration  variable,  r,  such 
that  x "  +  x1  =  -cr/a.  The  resultant  field  on  the  grat¬ 
ing  is  given  by 
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utc2(x”\  (o)  =  p((o  -  <DO)exp[j((0  -  w0)t0] 

ax " 

X  exp  j{oj  -  oj0) - 

c 


X 


X  exp[  y(  o)  -  w0)r]dr.  (16) 

The  effect  of  diffraction  from  the  output  grating  of  the 
SPD  is  introduced  into  Eq.  (16),  yielding 


<w)  =  p(<x)  -  w0)exp[j(w  -  <o0)t0] 


x  expjy|(*>  -  <o0)-  +  kg 


X 


X  exp [j(o>  -  co0)r]dr.  (17) 

The  momentum  of  the  diffraction  grating,  kg  =  aco0/c, 
cancels  the  angular-dispersion  term  of  the  output  field, 
such  that  all  the  frequency  components  of  the  short  pulse 
copropagate  at  an  angle  0  relative  to  the  optical  axis.  To 
propagate  away  from  the  grating  plane,  we  reintroduce 
the  longitudinal  dependence  as  appears  in  Eq.  (3)  [using 
the  local  z "  coordinate  (see  Fig.  1)  and  neglecting  a  con¬ 
stant  phase  factor  from  the  shift  from  z  to  z"].  We  also 
replace  the  aperture  function  w(  )  by  the  original  spatial¬ 
mode  function  of  the  input  waveform,  wx(  )  [see  Eq.  (4)]. 
Additionally,  we  eliminate  the  D/2  shift  in  the  pupil  func¬ 
tion,  which  was  introduced  to  satisfy  the  phase-matching 
condition  in  the  Fourier  plane  of  the  processor,  yielding 


“outi (x\z",a>)  =  p{<x)  -  tt0)expL/(w  -  co0)t0] 

CO 

X  exp  j  —  ( x "  sin  0  -  z"  cos  6) 
c 


x  J  —x"  cos  0  —  z"  sin  0 

°  J  c  \ 

- t cos  0  \m\ - r 

a  I  Uo  a  / 

X  exp [j(<o  -  w0)r]dr.  (18) 

We  perform  a  rotation  of  coordinate  systems  from  the  op¬ 
tical  system’s  ( x ",  z")  to  that  of  the  output  pulse  (x2,z2) 
(see  Fig.  1)  and  take  the  inverse  temporal  FT  to  get  the 
output  in  space-time  coordinates, 


^'out(**'2  >  >  0  “  ®Xp  J  - Z 2  “  COq/ 


O>0 


X 


C  *\  /XlC  \ 

*  /  Uo a  j 


X  p\  t  -  t0  - - r)dr. 

c 


(19) 


Equation  (19)  describes  the  output  temporal  pulse  as  an 
integral  that  mixes  the  information  from  the  input 
temporal-pulse  structure,  the  spatially  encoded  mask, 
and  the  spatial-aperture  function.  This  type  of  associa¬ 
tion  between  the  spatial  and  temporal  characteristics  has 
been  shown  for  other  cases  of  ultrafast-waveform 
processing.4*15’16  When  the  beam  size  of  the  input  short- 
pulse  signal  is  very  narrow,  the  desired  functionality  will 
not  be  achieved.  Therefore  in  most  practical  cases  of  in¬ 
terest  we  operate  the  SPD  in  the  high-resolution  limit. 
In  this  case,  the  spatial-mode  extent  (or,  more  precisely, 
time-of-flight  duration)  wx(ct  cos  0/a)  is  much  longer  than 
the  duration  of  the  ultrashort  pulse  p(r).  Thus  the  inte¬ 
gral  can  be  approximated  by  simply  evaluating  the  input 
pupil  at  7  =  t  —  to  —  z2/cf  which  is  the  center  coordinate 
of  the  short  pulse  pit),  and  taking  it  out  of  the  integral, 
yielding 


^out(^2j^2j  t)  — 


exp  J 


O)0 

— 22  “  <i)ot 
c 


X  w 


C  *2 

—x2 - cos  6  t  —  t0 - 

a  c 


XyV  ~to~  7/’  (20) 

where  y(/)  is  defined  by  the  convolution  of  the  input  pulse 
and  a  time-scaled  encoding  mask,  such  that 


y(t)m 


I m  (  7  a  T)P^  ~  T^dr  =  771 


ill.) 

X0  <*  I 


(21) 


Equation  (20)  describes  an  ultrafast  waveform  propagat¬ 
ing  in  free  space  in  the  z2  direction.  The  temporal  char¬ 
acteristic  of  the  output  waveform,  y(f),  is  determined  by 
the  convolution  operation  of  Eq.  (21).  The  spatial  mode 
of  the  output  waveform  is  now  space-time  dependent,  as 
is  characteristic  in  filtering  experiments  of  spatially  dis¬ 
persed  temporal-frequency  components.17’18  This  depen¬ 
dence  travels  along  with  the  output  waveform  and  repre¬ 
sents  a  linear  skew  of  the  spatial  mode  relative  to  the 
time  position  of  the  output  waveform  y(t).  Thus  there 
may  be  variations  of  the  observed  waveform  at  different 
spatial  locations,  a  phenomenon  that  may  influence  point 
processes  such  as  detection,  spatial  filtering,  and  coupling 
of  the  output  signal  into  a  single-mode  fiber. 

We  next  analyze  the  conversion  process  in  the  nonlin¬ 
ear  crystal,  taking  into  account  the  effect  of  the  phase- 
mismatch  terms  of  the  CSN. 


3.  CASCADED  SPATIAL-TEMPORAL  WAVE 
MIXING 

The  real-time  spatial-temporal  wave  mixing  is  performed 
by  cascaded  femtosecond-rate  parametric  interactions  in 
a  crystal  exhibiting  a  strong  *<2)  coefficient.  The  CSN 
process  uses  a  combination  of  frequency  upconversions 
and  frequency  downconversions  and  has  to  satisfy  the 
conditions  of  energy  and  momentum  conservation  for  ef¬ 
fective  energy  transfer.  Owing  to  the  broad  temporal 
bandwidth  of  ultrashort  pulses  and  the  broad  angular 
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bandwidth  of  the  spatial-information  channel,  it  will  be 
practically  impossible  to  satisfy  the  phase-matching  re¬ 
quirement  necessary  to  achieve  high  conversion  effi¬ 
ciency.  Therefore  nonuniform  signal  conversion  across 
the  temporal  and  spatial  bandwidths  will  be  observed,  as 
phase  matching  cannot  be  supported  in  every  situation. 
In  this  section  we  focus  on  the  analysis  of  the  nonlinear 
conversion  process  in  the  crystal. 

To  simplify  the  analysis,  we  assume  that  the  generated 
waves  in  the  three-wave  mixing  processes  of  the  cascade 
are  operating  in  the  linear-conversion  regime  (i.e.,  weak 
interaction  leading  to  nondepleting  fundamental  waves). 
The  linear  interaction  allows  us  to  decompose  the  waves 
to  their  temporal-  and  spatial-frequency  domains,  to  treat 
each  frequency  component  individually,  and  to  integrate 
over  the  temporal  and  spatial  bandwidths  to  get  the  out¬ 
put  waveform.  By  decomposing  the  waves,  we  may  use 
the  monochromatic  plane-wave  solution  of  the  wave¬ 
mixing  process.  The  plane-wave  solution  will  be  valid  as 
long  as  the  nonlinear  crystal  is  within  the  confocal  pa¬ 
rameter  of  the  SDW.  Since  we  are  interested  in  high- 
resolution  processing,  which  has  a  short  confocal  param¬ 
eter,  the  allowable  crystal  length  will  be  limited,  thereby 
reducing  the  conversion  efficiency.  In  general,  there  will 
be  a  trade-off  between  the  resolution  and  the  conversion 
efficiency,  which  is  not  within  the  scope  of  this  study.  We 
express  the  spatially  and  temporally  decomposed  input 
fields  inside  the  nonlinear  crystal,  followed  by  a  solution 
to  the  coupled-wave  equations  under  the  nondepleting- 
pump  approximation.  We  calculate  the  phase-mismatch 
terms  in  the  appendix. 

A.  Decomposed  Temporal  and  Spatial  Fields  Inside  the 
Nonlinear  Birefringent  Crystal 

The  spectrally  decomposed  field  from  the  input  temporal 
channel  is  found  by  evaluating  the  Fourier  integral  of  Eq. 
(7),  yielding 


(o)  =  p(co  -  o)0 )w 


2  77C 


- —  +  (o)  -  (X)0) 

af 


2  77  D 

xrap|  -JTTfx‘ 


X  exp 


-j(<0  -  <oQ ) 


aD 

2 7 


(22) 


where  the  spatial  FT  of  the  input-pupil  function,  w(  ), 
has  mixed  space-  and  temporal-frequency  variables,  char¬ 
acterizing  the  spatial  dispersion,  and  we  use  t0 
=  aD/2c.  The  spatial  width  of  w  also  defines  the  confo¬ 
cal  parameter,  which  limits  our  crystal  length.  The  spa¬ 
tial  linear  phase  term  in  Eq.  (22)  determines  the  propa¬ 
gation  direction,  owing  to  the  spatial  shift  of  the  input 
pupil  by  D/2.  We  denote  the  angle  between  the  z  axis 
and  the  k  vector  of  the  propagating  wave  as  61 
=  -D/2  /,  using  the  paraxial  approximation.  The  SDW 
is  incident  upon  the  nonlinear  crystal  in  the  FT  plane  of 
the  lens.  At  the  crystal  interface,  located  at  z'  =  0  (see 
Fig.  1),  the  wave  is  refracted  into  the  crystal  in  accor¬ 
dance  to  Snell’s  law.  The  SDW  in  the  crystal,  when  ne¬ 


glecting  the  Fresnel  reflection  at  the  boundary  and  ac¬ 
counting  for  field  attenuation  in  a  higher-index  material, 
can  be  expressed  as 
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where  n {0m)  is  the  refractive  index  for  ordinary-polarized 
light  at  frequency  to,  and  the  hat  overscript  denotes  vari¬ 
ables  inside  the  crystal  (e.g.,  X  =  X/ti^).  Note  that  the 
linear  phase  terms  of  Eqs.  (22)  and  (23)  are  equal  in  mag¬ 
nitude,  owing  to  the  conservation  of  momentum  in  the  x ' 
direction  (i.e.,  kinematic  condition).  The  propagation  di¬ 
rection  inside  the  crystal  is  frequency  dependent,  owing 
to  the  dispersive  nature  of  the  refractive  index  of  the  crys¬ 
tal. 

The  mask  containing  the  spatial-domain  information 
generates  the  field  U2  in  the  FT  plane,  found  by  evaluat¬ 
ing  the  Fourier  integral  of  Eq.  (9).  For  the  purpose  of 
this  analysis  we  model  the  spatial-information  channel  by 
Six  -  £),  a  Dirac  delta  function  shifted  from  the  center  of 
the  information  mask  by  the  parameter  £.  The  synthe¬ 
sized  waveform  from  an  arbitrary  mask  mix)  is  calcu¬ 
lated  by  multiplying  the  delta-function  response  by  m(£) 
and  integrating  over  the  parameter  £  (yielding  the  convo¬ 
lution  integral).  By  use  of  the  delta-function  model  the 
first  input  spatial  channel  generates  an  extraordinary- 
polarized  plane  wave  with  a  single  spatial  frequency, 


277 


U2ix\  *>i)  =  exp 


D 

2 7 


(24) 


The  angle  between  the  z'  axis  and  the  propagation  direc¬ 
tion  of  this  wave  is  denoted  by  02i£)  =  (D/2  —  £)//.  The 
refracted  field  in  the  crystal,  U2 ,  preserves  the  momen¬ 
tum  in  the  x '  direction  and  attenuates  the  field  by  the  ex¬ 
traordinary  refractive  index.  Evaluating  the  propaga¬ 
tion  angle  and  k  vector  is  complicated  by  the  refractive- 
index  dependence  on  the  propagation  direction  and  is 
performed  in  the  appendix. 

The  second  spatial  channel,  consisting  of  the  delta 
function  at  x  =  -D/2,  generates  an  ordinary-polarized 
plane  wave  in  the  Fourier  plane  of  the  processor.  Evalu¬ 
ating  Eq.  (11)  and  accounting  for  the  refraction  into  the 
crystal  yields 


Uzix';  o>x)  = 
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where  n{QWl)  is  the  ordinary  refractive  index  at  frequency 
o)i .  The  angle  between  the  z'  axis  and  the  propagation 
direction  of  the  field  before  refraction  is  denoted  by  0Z 
=  D/2/.  The  nonlinear  interaction  of  the  input  fields 
Ul ,  XJ2 ,  and  Uz  is  analyzed  next. 
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B.  Solution  of  Coupled- Wave  Equations  Characterizing 
Cascaded  Processes 

The  general  solution  to  the  coupled-mode  equations  gov¬ 
erning  noncollinear  three-wave  processes  can  be  very 
complex.19  When  we  limit  the  analysis  to  weak  interac¬ 
tions  with  nondepleting  input  waves  and  slowly  varying 
functions  in  space  and  time  (which  holds  for  the  SDW),  an 
analytic  solution  to  the  process  is  possible.  Owing  to  the 
nondepleting-w ave  approximation,  only  two  differential 
equations  remain,  which  describe  the  evolution  of  the  in¬ 
termediate  wave,  U int ,  generated  by  a  frequency-sum 
process,  and  the  resultant  output  wave,  U4,  generated  by 
a  frequency-difference  process.  Geometrical  factors  are 
introduced  to  the  coupled  equations  owing  to  the  noncol¬ 
linear  arrangement  (where  we  ignore  the  effect  of  the 
walk-off  angle  between  the  k  vector  and  the  Poynting  vec¬ 
tor  in  the  propagation  of  extraordinary-polarized  fields  in 
an  anisotropic  crystal).  We  denote  the  angle  between  the 
z  axis  and  the  k  vector  of  the  intermediate  wave  as  6^t 
and  the  angle  between  the  z  axis  and  the  k  vector  of  the 
output  field  as  64  (both  defined  inside  the  crystal).  Both 
these  angles  vary  as  a  function  of  and  g  (the  dependence 
is  evaluated  in  the  appendix).  Neglecting  absorption  in 
the  crystal,  the  differential  equations  governing  wave 
mixing  of  monochromatic  plane  waves  are  given  by20 

dz 
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where  d  is  the  second-order  nonlinear  optical  coefficient. 
The  right-hand  side  of  the  equations  have  been  simplified 
by  substituting  the  center-frequency  value  for  the 
propagation-direction  angles,  6^  and  d4 ,  the  electric  per¬ 
mittivity  and  magnetic  permeability,  e  and  /x,  and  the  op¬ 
tical  carrier  in  the  term  preceding  the  nonlinear  polariza¬ 
tion.  The  phase-mismatch  terms  for  the  frequency- 
sum  and  frequency-difference  processes,  A k{1)(w,  g)  and 
Aki2)(w,g),  respectively,  are  given  by 

Aka)(w,  g)  =  ki(to)  +  k2(|)  -  k^w,  g),  (27a) 

Ak(2)(w,  g)  =  ks-  ^(w,  g)  +  k«(w,  g).  (27b) 

Figure  2  illustrates  a  graphical  representation  of  the 
phase-mismatch  terms  of  Eqs.  (27a)  and  (27b).  The 


equations  describing  the  phase  mismatch  in  the  wave¬ 
mixing  processes  are  nonlinear.  To  simplify  the  phase- 
mismatch  representation,  we  perform  a  small-signal 
analysis  about  co  ~  co0  and  £  =  0  and  linearize  the  rela¬ 
tionship.  The  details  of  the  linearization  process  and 
evaluation  of  the  two  phase-mismatch  terms  are  pre¬ 
sented  in  the  appendix. 

A  simple  analytic  solution  to  the  coupled  differential 
equations  can  be  obtained  by  arguing  that  since  the  gen¬ 
erated  wave  U4  is  weaker  than  the  input  waves,  we  can 
neglect  the  second  nonlinear  polarization  term  in  Eq. 
(26a)  (i.e.,  UiU2  >  U3U4).  This  assumption  decouples 
the  differential  equations,  implying  that  the  rate  at  which 
the  intermediate  wave  is  depleted  by  the  second  process 
of  the  cascade  is  much  smaller  than  the  rate  at  which  it  is 
being  generated.  Therefore  Eq.  (26a)  of  the  intermediate 
field  is  dependent  on  the  two  known  input  fields  and  can 
be  solved  directly.  This  solution  is  inserted  into  Eq.  (26b) 
describing  the  evolution  of  the  output  wave,  which  can 
also  be  solved  directly  now,  as  the  driving  waves  are  com¬ 
pletely  characterized.  The  solution  yields 


l  Polarization  parallel  to  crystal's  ordinary  axis 
©  Polarization  parallel  to  crystal's  extraordinary  axis 

Fig.  2.  Graphical  representation  of  the  phase  mismatch  in  non¬ 
collinear  mixing  in  an  anisotropic  crystal  for  (a)  the  upconversion 
process  and  (b)  the  downconversion  process. 
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U4(x\ £,z;  w) 
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where  the  complex  transfer  function,  HN-J  co,  £,  z),  char¬ 
acterizes  the  response  of  the  cascaded  wave-mixing  pro¬ 
cess  and  is  given  by 
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The  complex  transfer  function  of  Eq.  (29)  depends  on  the 
temporal-frequency  component  of  the  input  pulse,  on  the 
spatial  frequency  arising  from  the  amount  of  shift  f  of  the 
delta  function  in  the  encoding  mask,  and  on  the  length  of 
the  nonlinear  crystal.  Since  the  complex  transfer  func¬ 
tion  is  temporal-frequency  dependent,  the  output  wave¬ 
form  will  not  be  transform  limited.  However,  any  pre¬ 
dominant  quadratic-phase  term  can  be  eliminated  by 
longitudinal  adjustment  of  the  output  grating.  Reintro¬ 
ducing  the  encoding-mask  information,  the  SDW  of  the 
synthesized  waveform  at  the  edge  of  the  crystal  is  defined 
by 

U,(x\Lc;  «)«#!(*';  «)&£(*';  »i) 
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where  we  replaced  the  interaction  distance  z  by  the  thick¬ 
ness  of  the  crystal,  Lc.  To  find  the  output  temporal 
waveform,  Eq.  (30)  needs  to  be  spatially  Fourier  trans¬ 
formed  to  the  output  plane,  diffracted  from  the  output 
grating,  and  inverse  temporally  Fourier  transformed. 
Equation  (30)  cannot  be  solved  analytically  except  for  the 
most  trivial  cases,  thus  requiring  a  computation  program 
to  calculate  the  precise  space-to-time  converted  wave¬ 
form.  Such  a  program  could  also  compute  the  conversion 
efficiency,  which  will  be  information  dependent. 

For  practical  reasons  it  is  desirable  for  the  space-to- 
time  processor  to  operate  efficiently.  The  conversion  ef¬ 
ficiency  of  the  wave-mixing  process  with  CSN  can  be  cal¬ 
culated  with  Parseval’s  theorem,  as  the  total  power  in  the 
spectral  representation  is  equal  to  the  total  power  in  the 
time  domain.  Since  Eq.  (30)  defines  the  SDW  of  the  syn¬ 
thesized  waveform,  the  power  spectrum  is  equal  to  the 
spatial  intensity  distribution  at  the  output  face  of  the 
crystal.  The  power  spectrum,  or  output  intensity,  can  be 
evaluated  analytically  for  a  simple  mask  such  a s  mix) 
=  Six),  yielding 
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where  we  introduce  the  optical  powers  P1,  P2,  and  P3  of 
the  input  temporal  channel,  the  spatial-information  chan¬ 
nel,  and  the  spatial  reference  channel,  respectively,  and 


The  transfer  function  will  perform  spectral  filtering,  as  is 
evident  from  Eq.  (31).  The  conversion  efficiency,  defined 
as  the  ratio  of  the  powers  of  the  output  synthesized  wave¬ 
form  to  the  input  temporal  channel,  depends  linearly  on 
the  power  level  of  each  spatial  pump  channel.  Our  find¬ 
ings  were  experimentally  validated  and  are  presented  in 
the  following  section. 


4.  EXPERIMENTAL  RESULTS  OF  SPACE-TO- 
TIME  CONVERSIONS 

The  operation  of  the  spatial-temporal  processor  of  Fig.  1 
was  verified  by  synthesizing  various  ultrafast  temporal 
waveforms.  In  the  experiments  we  used  a  commercial  la¬ 
ser  system  consisting  of  a  Ti: sapphire  ultrashort  pulse  os¬ 
cillator  and  a  regenerative  amplifier.  The  system  gener¬ 
ates  ultrashort  laser  pulses  of  100-fs  duration  at  a  center 
wavelength  of  800  nm  with  an  energy  level  of  1  mJ  per 
pulse.  For  the  intense  quasi-monochromatic  light  source 
required  by  the  spatial  channels  we  used  90%  of  the  emit¬ 
ted  output-pulse  power  and  stretched  the  pulse  to  a 
several-picosecond  duration  by  a  grating  pair.  The 
stretched  pulse  was  split  into  two  beams  for  implement¬ 
ing  the  two  quasi-monochromatic  spatial  channels. 
Since  the  CSN  process  occurs  with  a  femtosecond-scale 
time  response,  the  slow  temporal-frequency  variation  in 
the  stretched  pulses  does  not  affect  the  wave-mixing  pro¬ 
cess,  as  long  as  the  temporal  frequencies  of  the  two  spa¬ 
tial  channels  are  instantaneously  equal.  The  remaining 
10%  of  the  short-pulse  laser  output  power  was  used  as  the 
reference  ultrashort  pulse  in  the  input  temporal  channel. 
The  SDW  U i  is  generated  by  a  600-lines/mm  blazed  grat¬ 
ing  that  provides  an  angular-dispersion  parameter  of  a 
=  0.48  and  a  lens  of  f  =  375  mm  focal  length.  The 
spatial-temporal  wave  mixing  by  the  xi2)  media  was  per¬ 
formed  in  a  2-mm-thick  type  II  /3-barium  borate  crystal. 
Several  experiments  were  conducted  to  illustrate  this 
real-time  processing  technique,  demonstrating  its  ability 
to  control  amplitude  and  phase  in  the  output  temporal 
waveform  by  a  complex-amplitude  spatial-information 
channel. 

All  the  synthesized  waveforms  were  observed  with  a 
real-time  pulse-imaging  setup.21  The  pulse-imaging 
technique  generates  a  spatial  signal  at  the  output  plane 
that  is  proportional  to  the  temporal  convolution  of  the 
synthesized  waveform  and  a  reference  ultrashort  pulse. 
The  reference-pulse  source  used  in  the  pulse  imager  was 
the  residual  input  pulse  of  the  spatial- temporal  proces¬ 
sor,  after  separating  it  from  the  synthesized  output  wave- 
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form  with  a  polarizing  beam  splitter.  We  viewed  the  out¬ 
put  signal’s  intensity  with  a  charge-coupled  device  and 
extracted  the  temporal  information  from  the  image. 


A.  Conversion  Efficiency  and  Spectral  Response 
For  initial  characterization  of  the  spatial— temporal  wave- 
mixing  process  by  CSN,  the  two  monochromatic  waves 
were  focused  by  cylindrical  lenses  to  form  line  sources  at 
the  input  spatial  channels.  The  resulting  spatial  plane 
waves  Z72  and  Uz  interact  with  the  SDW,  Ul9  of  an  input 
transform-limited  short  pulse.  Since  the  plane  waves 
contain  no  information,  in  either  the  space  or  the  time  do¬ 
main,  their  function  is  to  upconvert,  then  downconvert, 
the  carrier  frequency  of  the  SDW,  generating  by  the  CSN 
process  the  new  SDW  U4 .  The  downconverted  SDW  U4 
was  generated  only  in  the  presence  of  the  waves  from  the 
two  spatial  channels,  verifying  that  the  output  SDW  is 
produced  by  a  CSN  process. 

The  dependence  of  the  synthesized  waveform’s  output 
power  to  the  optical  power  in  each  of  the  spatial  channels 
was  characterized  by  placing  variable  neutral-density  fil¬ 
ters  in  the  spatial  channels  (see  Fig.  3).  The  linear  de¬ 
pendence  of  the  output  power  to  each  spatial  channel  is 
verified  by  the  excellent  fit  to  the  10-dB/dec  line.  When 
both  spatial  channels’  powers  are  varied  simultaneously, 
the  output  power  variation  fits  the  20-dB/dec  line.  The 
linear  dependence  confirms  our  assumption  of  the  nonde¬ 
pleting  pump  approximation  in  Section  3.  However,  we 
observed  that  the  dependence  is  sublinear  when  the  spa¬ 
tial  channels  powers  were  weak.  This  effect  is  more  pro¬ 
nounced  when  only  one  of  the  spatial  channels  is  attenu¬ 
ated.  We  believe  this  is  due  to  depletion  of  the 
attenuated  pump  wave,  as  our  analysis  of  Section  3  did 
not  take  into  account  strong  interaction  that  may  lead  to 
such  fundamental-wave  depletion.  The  maximum  con¬ 
version  efficiency  of  the  space-to-time  processor,  mea¬ 
sured  as  the  ratio  of  the  input-pulse  optical  power  to  the 
output-pulse  optical  power,  was  16%,  illustrating  the  ad- 


Pump  power  of  varied  spatial  channel  [mW] 

Fig.  3.  Output  power  measurements  as  a  function  of  spatial 
pump-beam  powers.  Varying  one  spatial  channel  while  keeping 
the  second  constant  illustrates  the  linear  dependence  of  the  out¬ 
put  temporal  power  on  each  spatial  channel.  Varying  the  power 
of  the  spatial  channels  simultaneously  displays  a  quadratic  de¬ 
pendence,  as  expected. 


Fig.  4.  (a)  Measured  power  spectrum  of  the  input  pulse  (solid 
curve)  and  the  generated  output  pulse  (dashed  curve)  exhibiting 
spectral  filtering  owing  to  a  spectrally  dependent  phase  mis¬ 
match  in  the  processor.  Also  shown  is  the  theoretical  output 
power  spectrum  (dot-dash  curve)  calculated  by  applying  the  spec¬ 
tral  filter  to  the  measured  input  power  spectrum,  (b)  Pulse  im¬ 
ages  of  the  input  pulse  (solid  curve)  and  the  generated  output 
pulse  (dashed  curve),  demonstrating  the  increased  duration  of 
the  output  pulse  owing  to  spectral  filtering. 


vantage  of  the  CSN  approach  as  opposed  to  conventional 
a  }  nonlinearity  for  four-wave  mixing.22 

The  power  spectrum  of  the  output  pulse  was  compared 
with  that  of  the  input  pulse  (see  Fig.  4a).  The  observed 
spectral  narrowing  is  due  to  an  increasing  phase  mis¬ 
match  as  the  frequencies  deviate  from  the  center  fre¬ 
quency.  A  theoretical  curve  of  the  expected  output-pulse 
power  spectrum  [based  on  Eq.  (31)]  shows  a  close  corre¬ 
spondence  to  the  actual  observed  spectrum.  The  theoret¬ 
ical  curve  was  generated  by  evaluating  the  phase  mis¬ 
match  for  the  two  processes  in  the  cascade  with  published 
/3-barium  borate  parameters23  for  type  II  noncollinear  in¬ 
teraction  with  a  4°  internal  angle  between  the  fundamen¬ 
tal  waves  (assuming  both  processes  are  phase  matched  at 
w  ~  311(1  f  =  0).  We  adjusted  the  center  frequency  of 

the  theoretical  curve  to  match  the  measured  data.  Ow¬ 
ing  to  the  reduction  of  the  available  bandwidth  from  spec¬ 
tral  filtering,  we  observed  an  accompanying  increase  in 
the  temporal  duration  of  the  output  pulse  (see  Fig.  4b). 
The  measured  full  width  at  half-maximum  of  the  detected 
image  in  the  time-to-space  converter  increased  from  182 
to  288  fs. 
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B.  Packet  of  Ultrafast  Pulses 

Novel  uses  for  bursts  of  ultrashort  pulses,  such  as 
terahertz-rate  excitation  and  optical  communication  with 
ultrashort  data  packets,  have  increased  the  interest  in  de¬ 
veloping  techniques  for  generation  of  such  ultrashort- 
pulse  sequences.  With  the  femtosecond-rate  space-to- 
time  converter,  such  a  pulse  sequence  can  be  generated 
by  a  one-dimensional  mask  containing  an  array  of  point 
sources.  The  mask  information  can  be  expressed  by 

Nf 2 

m{x)  =  2  AnS(x  -  rcAx),  (33) 

n  -  -N/2 

where  A*  is  the  fixed  pitch  between  adjacent  point 
sources,  and  An  is  a  scalar  weight  term  that  may  be  a  bi¬ 
nary  bit  (value  of  1  or  0),  a  real  number,  or  a  complex 
number.  By  use  of  this  information  mask  in  Eq.  (21),  the 
temporal  output  signal  takes  the  form 

N/2  ,  v 

I  CO}  ct  \ 

y(t)  =  2  Ans U - nAx  ®p(f) 

n^-N/2  \  <*>0  c  i 

N/2  .  j  v 

=  2  A^plt - -—n^  ) .  (34) 

In  the  space-to-time  mapping,  the  spatial  pitch  Ax  in  the 
mask  has  been  converted  to  a  temporal  separation  of  Af 
—  (o1a^x/(o0c. 

Various  masks  containing  narrow  slits  have  been  used 
in  the  space-to-time  setup  to  generate  ultrafast  pulse 
packets.  Previously,  we  reported  results  using  a  mask 
with  a  0.8-mm  pitch.12  A  finer  mask  with  a  0.4-mm  pitch 
was  used  to  generate  a  denser  ultrafast  pulse  packet  (see 
Fig.  5).  A  cylindrical  lenslet  array  was  used  to  focus  the 
spatial  beam  into  the  mask  for  increased  light  through¬ 
put.  The  spatial  beam  mode  shape  resulted  in  nonuni¬ 
form  power  on  each  spatial  slit.  This  power  distribution 
has  been  transferred  to  the  ultrafast  pulse  packet.  The 
space-to-time  mapping  coefficient  transforms  the  0.4-mm 
spatial  pitch  to  a  time-domain  separation  of  A t 
=  0.64  ps  (with  w1  =  o)  0  and  a  =  0.48).  The 


Fig.  5.  1.56-THz-rate  ultrashort-pulse  packet  generated  by  a 
spatial-information  mask  consisting  of  a  sequence  of  point 
sources  separated  by  0.4  mm. 
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measured  temporal  separation  was  0.645  ps  with  a  reso¬ 
lution  of  33  fs,  resulting  in  a  1.56-THz-rate  pulse  packet. 

C.  Square  Pulse 

Generation  of  synthesized  square  pulses  by  spectral  fil¬ 
tering  is  not  a  simple  task.  The  mask  should  contain  am¬ 
plitude  information  over  a  wide  dynamic  range  as  well  as 
phase  information  for  implementing  negative  values. 
Space-to-time  mapping  techniques  simplify  the  task  of 
complex  mask  preparation,  while  the  requirement  for  a 
wide  dynamic  range  in  the  recording  medium  remains. 
In  photorefractives,  where  the  recorded  space-charge  field 
is  dependent  on  the  modulation  function,  the  recording 
process  exhibits  a  nonlinear  dependence  on  the  recording 
beams.  While  this  enables  nonlinear  processing,24  for 
generation  of  high-fidelity  square  pulses  a  weak  informa¬ 
tion  beam  must  be  used,  giving  rise  to  shallow  modulation 
depths  and  low  diffraction  efficiency. 

In  our  wave-mixing  approach  with  CSN  the  amplitude 
of  the  synthesized  wave  is  dependent  on  the  amplitudes  of 
the  three  input  waves  (as  long  as  wave  depletion  and 
phase  mismatch  are  negligible).  This  linear  relationship 
gives  rise  to  high-fidelity  conversion  from  space  to  time. 
In  the  case  of  a  square  pulse  the  synthesized  signal  may 
be  expressed  as 

y(f)  =  rectjyj  ®P(<)>  (35) 

where  T  =  a)laX/o)0c  is  the  duration  of  the  rect  function 
and  X  is  the  width  of  the  slit. 

An  adjustable  slit  was  placed  in  the  spatial-information 
channel  to  experiment  with  several  synthesized  square- 
pulse  durations.  The  generated  waveforms  (see  Fig.  6) 
were  modified  in  real  time  by  changing  the  slit  width,  and 
the  square  pulses  were  immediately  observed  with  the 
pulse  imager  on  a  monitor.  At  wide  slit  widths,  nonuni- 
formity  across  the  top  of  the  square  pulse  has  been  ob¬ 
served.  We  have  verified  that  the  reason  is  due  to  non¬ 
uniformity  in  the  illumination  of  the  spatial  channel.  An 
image  of  the  spatial-information  channel  in  the  space-to- 
time  processor  matched  well  with  the  observed  image  of 
the  synthesized  waveform  in  the  time-to-space  converter. 
There  are  two  limitations  to  the  maximum  slit  width  that 
can  be  used:  the  resolution  of  the  SPD  [see  Eq.  (19)],  and 
the  optical  power  from  the  tight  focus  of  the  spatial  chan¬ 
nel  in  the  Fourier  plane  may  exceed  the  damage  thresh¬ 
old  of  the  nonlinear  crystal. 

D.  Spherical  Wave  Front 

To  demonstrate  the  ability  to  encode  phase  information,  a 
point  source,  generated  by  focusing  the  spatial  beam  with 
a  cylindrical  lens,  was  used  as  the  spatial-information 
channel.  Translating  the  lens  longitudinally  away  from 
the  input  plane  formed  a  spherical  wave  front  with  vari¬ 
able  quadratic  phase  on  the  input  plane  of  the  spatial- 
information  channel.  As  the  translation  of  the  line 
source  from  the  input  plane  is  increased  (in  either  the 
positive  or  the  negative  direction),  the  output  SDW  ac¬ 
quires  a  larger  positive  or  negative  quadratic  phase  and 
after  recomposing  on  the  output  grating,  emerges  as  a 
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chirped  pulse  (either  a  positive  or  negative  chirp).  The 
measured  synthesized  chirped  pulses  exhibited,  as  ex¬ 
pected,  broader  pulses  along  with  a  reduction  of  the  peak 
intensity.12 

The  effect  of  a  longitudinal  translation  of  the  point 
source  away  from  the  input  plane  of  the  processor  is  ex¬ 
pressible  as  a  convolution  with  a  quadratic-phase 
function25  (free  space  propagation),  resulting  in  the  field 
distribution  in  the  spatial-information  channel, 


mix)  = 


exp (jk£) 
AiC 


1  * 

“p|  —  57/- 


(36) 


where  £  is  the  displacement  of  the  focus  plane  of  the  cy¬ 
lindrical  lens  from  the  input  plane  of  the  processor.  The 
phase  information  described  in  Eq.  (36)  gives  rise  to  the 
synthesized  waveform 


exp  (jk£) 

y{t)  =  ~T\  7  expl 


(o0ct\ 

J0)1  \  eo1aj 

c  i7~ 


21 


®p(t)9  (37) 


which  describes  a  chirp  function  (quadratic  temporal- 
frequency  variation)  imposed  on  the  input  pulse.  A 
Fourier-domain  representation  of  the  output  synthesized 
signal  is  easier  to  interpret  in  this  case,  yielding 


Translation  of  line  source  away  from  input  plane  [mm] 

Fig.  7.  Linear  correspondence  between  translation  of  the  point 
source  away  from  the  input  plane  of  space-to-time  converter  and 
translation  of  the  image-formation  plane  in  the  pulse  imager. 
The  solid  line  corresponds  to  a  theoretical  curve  of  slope  1/2. 


cation  of  the  plane  at  which  the  wave  focuses  changes  (ei¬ 
ther  farther  or  closer).  The  spatial  output  signal  of  the 
pulse  imager  can  be  expressed  as21 


b{x")  =  p 


2a  \  I  2a  \ 

Tx’Vy[~x") 


y(o>)  a  exp 


-J 


<o  —  <oq  \  2  o>1£a2 


oj0 


p(a>  -  co0).  (38) 


From  Eq.  (38)  it  is  clear  that  the  amount  of  chirp  is  lin¬ 
early  dependent  on  the  displacement  £. 

To  measure  the  amount  of  chirp  that  is  present  in  the 
synthesized  waveform,  we  utilize  a  unique  property  of  the 
pulse  imager.  Since  the  pulse  imager  generates  the 
complex-amplitude  information  of  the  temporal  convolu¬ 
tion  operation  in  spatial  coordinates,  the  resulting  spatial 
quadratic  phase  in  the  image  of  a  chirped  pulse  can  be 
used  to  gauge  the  amount  of  chirp.  By  longitudinally 
translating  the  output  plane,  a  location  can  be  found 
where  the  wave  focuses  to  its  tightest  spot  size,  eliminat¬ 
ing  the  spatial  quadratic  phase.21  Therefore  as  the  pulse 
chirp  is  increased  (either  positively  or  negatively),  the  lo- 


<8>  exp 


which  corresponds  to  the  convolution  of  the  ultrashort 
pulses  with  a  quadratic-phase  function.  Using  the  ker¬ 
nel  of  free-space  propagation  at  frequency  2a>0, 
exp[j(2  a>0  Ic)(x"2I2Z  )  ] ,  where  Z  is  the  displacement  from 
the  image  plane,  we  seek  the  value  of  Z  that  will  cancel 
the  quadratic-phase  terms  in  Eq.  (39).  This  value  of  Z 
corresponds  to  the  location  where  the  signal  focuses  to  its 
tightest  spot  size.  The  value  of  Z  is  given  by 


O)0 

—  2x" 
Jo> i  \  co1 
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(39) 


Fig.  6.  Superimposed  images  of  square  pulses  generated  by 
varying  the  width  of  a  square  aperture  in  the  spatial  channel. 


This  scaling  rule  is  dependent  on  the  ratio  of  the  frequen¬ 
cies  used  in  the  spatial  and  temporal  channels  of  the 
space-to-time  converter  and  a  factor  of  2  owing  to  the 
doubled-frequency  output  of  the  pulse  imager. 

We  have  verified  experimentally  the  relation  in  Eq.  (40) 
by  finding  the  location  where  the  output  of  the  pulse  im¬ 
ager  focuses,  for  each  translated  location  used  in  the  spa¬ 
tial  channel  of  the  space-to-time  converter  (see  Fig.  7). 
In  our  experiment  the  scaling  rule  reduces  to  1/2  since 
Thus  we  have  a  correspondence  between  the 
longitudinal  translation  of  the  line  source  in  the  input 
spatial  channel  of  the  spatial-temporal  processor  and  the 
longitudinal  translation  of  the  focus  plane  of  the  pulse  im¬ 
ager. 
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5.  CONCLUSION  AND  DISCUSSION 

We  have  introduced  and  experimentally  demonstrated 
femtosecond-rate  space-to-time  conversion  of  amplitude 
and  phase  information  from  a  quasi-monochromatic 
spatial-image  channel  to  an  ultrafast  temporal  waveform. 
Relative  to  other  spatial-temporal  processing  techniques, 
the  CSN  four-wave  mixing  approach  provides 
femtosecond-rate  processing  owing  to  the  fast  bound- 
electron  nonlinearity  and  high  efficiency  on  account  of  a 
relatively  large  *(2)  coefficient.  Such  capabilities  are  re¬ 
quired  for  quantum  control  experiments  or  ultrafast  data 
modulation  in  optical  communications,  where  rapid,  high- 
fidelity  waveform  manipulations  are  required. 

To  model  limitations  imposed  by  phase-matching  con¬ 
siderations  in  the  nonlinear  wave-mixing  process,  we 
have  introduced  a  complex  transfer  function  to  character¬ 
ize  the  system  response.  The  phase  mismatch  in  the 
wave-mixing  process  arises  from  the  broad  temporal-  and 
spatial-frequency  content  of  the  input  temporal-pulse  and 
spatial-image  information,  respectively.  The  transfer 
function  can  be  used  to  compute  the  precise  space-to-time 
converted  waveform  and  assist  in  optimal  selection  of  a 
nonlinear  ciystal  for  real-time  pulse  shaping.  The  con¬ 
version  efficiency  from  the  input  reference  pulse  to  the 
output  synthesized  waveform  was  found  to  depend  on  the 
power  levels  of  the  two  spatial  channels. 

Several  experiments  were  conducted  to  illustrate  the 
ability  to  convert  amplitude  and  phase  information  from 
the  space  domain  to  the  time  domain  with  high  efficiency. 
The  spectrum  reduction  in  the  synthesized  waveform  ow¬ 
ing  to  the  complex  transfer  function  was  observed  and 
was  in  good  agreement  with  the  predicted  spectrum.  All 
the  experiments  were  conducted  on  a  single-shot  basis, 
highlighting  the  benefit  of  processing  with  a  fast  nonlin¬ 
ear  effect. 

The  space-to-time  conversion  process  that  we  have 
demonstrated  uses  an  identical  quasi-monochromatic 
light  source  in  the  two  spatial  channels.  This  require¬ 
ment  is  typical  for  holographic  processing,  where  the  two 
waves  need  to  be  mutually  coherent  during  the  long  re¬ 
cording  phase.  Since  our  processor  is  based  on  a  real¬ 
time  four-wave  mixing  process,  this  requirement  can  be 
alleviated.  The  necessary  condition  for  a  faithful  signal 
conversion  by  nonlinear  wave  mixing  is  that  each  of  the 
spatial  channels  be  implemented  with  a  monochromatic 
light  source,  with  no  temporal  bandwidth.  With  this  con¬ 
dition  the  nonlinear  polarization  term  can  be  trivially  cal¬ 
culated  [as  we  have  done  in  Eqs.  (12)  and  (13)].  It  is 
therefore  feasible  to  perform  wavelength  tuning  of  the 
synthesized  temporal  waveform  by  use  of  different  optical 
frequencies  in  the  two  spatial  channels.  The  center  fre¬ 
quency  of  the  synthesized  temporal  waveform  is  shifted 
by  the  frequency  difference  of  the  two  sources  used  in  the 
spatial  channels.  Since  quasi-monochromatic  light 
sources  are  used  in  reality,  we  require  that  the  coherence 
time  of  the  light  sources  be  much  longer  than  the  time 
window  of  the  SPD  (which  is  a  very  simple  requirement  to 
satisfy).  The  propagation  directions  of  the  fields  from  the 
two  spatial  channels  may  require  adjustment,  such  that 
the  phase-matching  condition  is  satisfied  for  the  two  pro¬ 
cesses  of  the  cascade.  The  hypothesized  wavelength¬ 
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timing  property  of  the  real-time  four-wave  mixing  proces¬ 
sor  is  unique  to  our  space-to-time  technique  and  cannot 
be  accomplished  by  holographic  recording  means. 

APPENDIX  A:  EVALUATION  OF 
PHASE-MISMATCH  TERMS 

The  output  signal  of  the  CSN  process  was  shown  in  Sub¬ 
section  3.B  to  depend  on  the  phase-mismatch  terms  in  the 
coupled-wave  equations  characterizing  the  generation  of 
the  new  fields.  The  phase  mismatch  occurs  since  the 
phase-matching  requirement  for  efficient  energy  transfer 
cannot  be  supported  for  the  temporal  bandwidth  of  the 
SDW  and  the  angular  bandwidth  of  the  spatial- 
information  channel.  In  this  appendix  we  derive  the 
phase-mismatch  terms  for  the  two  processes  of  the  cas¬ 
cade,  using  the  information-channel  model  of  a  delta  func¬ 
tion  shifted  by  an  amount  £  The  phase-mismatch  terms 
obtained  in  this  appendix  are  inserted  into  Eq.  (29),  de¬ 
scribing  the  complex  transfer  function  of  the  processor. 

The  first  nonlinear  interaction  of  the  cascade,  the 
frequency-sum  process,  generates  the  intermediate  wave, 
U-mt.  (Note  that  all  properties  of  the  waves  inside  the 
crystal  are  with  the  hat  overscript. )  The  phase  mismatch 
of  the  first  process  of  the  cascade  is  defined  by  the  vector 
equation,  Eq.  (27a),  where  the  phase  mismatch  must  lie 
in  the  z  direction  owing  to  the  kinematic  condition  (con¬ 
servation  of  momentum  in  the  x '  direction).  Since  the  in¬ 
teracting  waves  are  propagating  at  very  small  angles 
relative  to  the  z  axis,  we  can  use  the  paraxial  approxima¬ 
tion  (note  that  Fig.  2  shows  greatly  exaggerated  angles), 
simplifying  the  phase-mismatch  expression  to  the  sum  of 
the  ^-vector  magnitudes,  i.e.,  Akn)(<oy  f)  ss  k^co) 
+  k2(()  “  £)*  We  next  find  these  ^-vector  mag¬ 

nitudes  for  evaluation  of  the  first  phase-mismatch  term. 

The  first  input  wave,  Ui ,  from  the  input  temporal 
channel,  is  polarized  parallel  to  the  crystal’s  ordinary 
axis.  Therefore  the  wave-vector  magnitude  is  indepen¬ 
dent  of  its  propagation  direction  and  is  given  by  ki(co) 

k(o)0)  4-  ( dk/d<o)(a)  -  o}0).  To  find  the  wave-vector 
magnitude  of  the  intermediate  wave,  ,  and  its  propa¬ 
gation  direction,  we  apply  the  kinematic  condition  using 
the  paraxial  approximation,  yielding 

*sat(«.  f)  =  *l(®)*l(*>)  +  *2(0$2(f> 

=  k1(o>)61  +  M2(f),  (Al) 

where  the  second  equality  results  from  invoking  the  kine¬ 
matic  condition  on  the  input  fields  across  the  crystal  in¬ 
terface.  (Before  refraction  into  the  crystal,  $x  is  not  fre¬ 
quency  dependent,  as  all  the  components  propagate  in  the 
same  direction  [see  Eq.  (22)],  and  the  magnitude  of 
does  not  vary  with  the  shift  of  in  £,  as  it  is  a  quasi- 
monochromatic  wave.)  To  analyze  the  effect  of  varying 
the  parameters  Aco  =  to  ~  ooq  and  £  we  perform  a  small- 
signal  perturbation  about  w  =  o>0  and  £  =  0  by  defining 
the  following: 

^int(^>  £)  ^int  A^int. 
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ki(co) 


(Oq  A  (Oq 


k2(e2)h  = 


d  t 

*2(f)  =  f*  (A2) 


where  k^t  and  bmt  are  the  & -vector  magnitude  and  angle 
of  propagation,  respectively,  for  the  intermediate  wave 
evaluated  at  co  =  o>0  and  £  =  0.  Inserting  the  perturbed 
expressions  into  Eq.  (Al),  neglecting  second-order  terms 
of  the  form  A^^A  $mt ,  and  simplifying  the  expression  by 
removing  the  constant  terms  that  satisfy  the  equality  on 
both  sides  of  the  equation  yields 
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where  we  used  01  =  -D/2f  and  k2  =  Wl/c.  We  can 
further  simplify  the  expression  by  setting  Ak^t 
~  (dkint/d(*>)Aa)  +  (dkint/d6)A$int)  resulting  in 
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where  we  used  the  definition  of  the  group  velocity,  vg  int 

=  (dk^J du))~l.  Using  this  result,  we  can  express  the  k 
vector  of  the  intermediate  wave  as 
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The  term  akirt/ 36  is  evaluated  by  [  ( w()  +  a> }  )/c  ] 
X  ( 3n[u°+0Jl)l  36),  where  the  rate  of  change  of  the  extraor¬ 
dinary  refractive  index  at  angle  6mt  can  be  found  by  dif¬ 
ferentiating  the  equation  governing  the  refractive-index 
dependence  on  angle  6  of  an  anisotropic  crystal.26 

Next,  we  find  the  ^-vector  properties  of  the  wave  from 
the  spatial-information  channel,  U2 ,  which  is  also  polar¬ 
ized  in  the  direction  of  the  extraordinary  axis.  Since  this 
wave  is  quasimonochromatic,  the  propagation  direction 
inside  the  crystal  determines  the  wave-vector  magnitude. 
We  find  the  propagation  direction  by  using  the  kinematic 
condition  across  the  crystal  interface  together  with  a 
small-signal  perturbation,  yielding 


"l 

s  —  (n2  +  An2)(62  +  A  62) 


c  \2 f  fj’ 
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where  d2  is  the  angle  between  the  propagation  direction 
and  the  z  axis  for  no  spatial  shift  of  the  delta  function. 
By  neglecting  second-order  terms  of  the  form  An2A  ~62, 
and  expressing  the  index  change  as  An2 
=  (dn'e°'1>/d6)A62,  we  find 
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where  the  derivative  of  the  refractive  index  can  again  be 
calculated.26  The  k  vector  of  the  field  is  given  by  a> j(n2 
+  A n2)/c,  yielding 


Fi&*  S.  Illustration  of  phase  mismatch  owing  to  broad  temporal 
bandwidth  (spatially  dispersed  along*')  and  angular  bandwidth. 
The  vector  sum  of  the  two  fundamental  waves  generates  a 
6-vector  front  that  is  compared  with  the  available  6-vector  front 
of  the  frequency-sum  wave.  Exact  phase  matching  is  possible  at 
each  temporal  frequency  with  different  components  of  the  spatial 
angular  bandwidth. 
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The  phase  mismatch  of  the  first  process  of  the  cascade  is 
therefore  found  by  combining  the  expressions  for  the 
wave-vector  magnitudes,  resulting  in 


A*(1>(<o,  f) 

=  ki(a>)  +  k2(i)  -  ^int (*>>  f) 

=  - 
c  0  c 


int) 


+  vg„  1  Vg-  int  + 


3kmt2fc 


+  (dkjdd)  A04,  neglecting  second-order  terms  of  the  form 
A/;  4  A  ,  and  simplifying  the  resulting  expression,  we  ob¬ 
tain 
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The  similarity  between  the  expressions  describing  the 
perturbation  angles  A^t  and  A  04  [Eqs.  (A4)  and  (All)] 
arises  from  the  identical  dependence  on  the  variational 
parameters  on  the  right-hand  sides  of  Eqs.  (Al)  and 
(A10).  The  wave-vector  magnitude  of  the  output  field  is 
expressed  by 
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=  Afe(1)(co0,  0)  + 
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The  constant  term  of  the  phase  mismatch  is  set  to  zero  by 
proper  crystal  selection,  cut,  and  orientation  set  to  satisfy 
the  phase-matching  condition  for  the  center  frequency  co0 
and  without  spatial  offset  £.  Figure  8  graphically  illus¬ 
trates  the  difference  between  the  vector  sum  of  k±(w) 
(spatially  dispersed  in  the  x'  direction)  and  k2(£)  (broad 
spatial  bandwidth  at  every  location)  and  ki nt(co,  f)  (avail¬ 
able  wave  vectors  that  the  crystal  supports). 

The  calculation  of  the  phase-mismatch  term  is  repeated 
for  the  second  interaction  of  the  cascade,  the  frequency- 
difference  process,  which  generates  the  output  wave,  U 4. 
By  reasoning  of  the  paraxial  approximation  the  phase 
mismatch  of  the  second  process  of  the  cascade  is  again  an 
equation  with  wave-vector  magnitudes,  i.e.,  A&(2)(co,  f) 
=  k3  +  k4((o9  £)  -  kmt(<x),  f).  The  magnitude  of  the 
wave  vector  from  the  second  spatial  channel  is  co^n^Vc, 
as  the  quasi-monochromatic  wave  is  polarized  in  the  di¬ 
rection  of  the  ciystaFs  ordinary  axis.  The  wave-vector 
properties  of  the  output  wave  are  calculated  with  the  ki¬ 
nematic  condition,  yielding 

k4(w,  i)64(<0,  i)  =  *int (<*>,  f)*int («.  0  ^  *3*3 

=  ^i(co)^!  +  k262{£)  ~  k202 ,  (A10) 

where  Eq.  (Al)  was  used  to  simplify  the  expression.  A 
small-signal  perturbation  about  co  -  co0  and  f  =  0  is  per¬ 
formed  on  Eq.  (A10)  to  analyze  the  effect  of  the  temporal 
and  angular  bandwidths.  Using  Ak4  =  (<9&4/<9co)Aco 
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The  phase  mismatch  of  the  second  process  of  the  cascade 
is  therefore  given  by 

A*(2)(co,  f) 

=  kZ  +  k4(<» ,  £)  -  &int(k>,  £) 
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The  constant  term  of  the  phase  mismatch  in  the  second 
process  can  be  shown  to  be  nonzero  by  use  of  the  phase- 
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matching  condition  from  the  first  process  of  the  cascade. 
The  resulting  constant  phase  mismatch  is  given  by 

COn  — 

Ak(2\a>0, 0)  =  —  O'Wo)(04)  -  re'"0)] 

-  yl -  b*“1>].  (A14) 

The  phase  mismatch  is  caused  by  the  crystal’s  birefrin¬ 
gence.  The  two  quasi-monochromatic  spatial  waves, 
which  copropagate  in  free  space,  have  different  propaga¬ 
tion  directions  inside  the  crystal  since  each  wave  experi¬ 
ences  a  different  refractive  index.  Therefore  the  second 
process  cannot  be  phase  matched  for  the  center  temporal- 
and  spatial-frequency  components.  However,  the  loca¬ 
tion  of  the  point  source  in  the  second  spatial  input  chan¬ 
nel  [see  Eq.  (10)]  can  be  slightly  shifted  in  experiment 
from  — Z)/2,  such  that  the  second  process  is  also  phase 
matched  and  high  conversion  efficiency  is  achieved.  This 
results  in  a  small  propagation-angle  offset  between  the 
residual  wave  U i  and  the  output  wave  Z74 ,  which  is  ex¬ 
pected  owing  to  the  kinematic  condition  and  was  observed 
in  our  experiments. 

Evaluation  of  the  various  propagation  angles  inside  the 
eiystal  and  the  crystal  properties  at  these  angles  are  re¬ 
quired  for  determining  Ak(1)  and  Ak(2\  which  are  used  in 
conjunction  with  the  nonlinear  conversion  analysis  of  Sec¬ 
tion  3. 
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The  operational  characteristics  of  a  time-to-space  processor  based  on  three-wave  mixing  for  instantaneous  im¬ 
aging  of  ultrafast  waveforms  are  investigated.  We  assess  the  effects  of  various  system  parameters  on  the 
processor’s  important  attributes:  time  window  of  operation  and  signal  conversion  efficiency.  Both  linear  and 
nonlinear  operation  regimes  are  considered,  with  use  of  a  Gaussian  pulse  profile  and  a  Gaussian  spatial  mode 
model.  This  model  enables  us  to  define  a  resolution  measure  for  the  processor,  which  is  found  to  be  an  im¬ 
portant  characteristic.  When  the  processor  is  operated  in  the  linear  interaction  regime,  we  find  that  the  con¬ 
version  efficiency  of  a  temporal  signal  to  a  spatial  image  is  inversely  proportional  to  the  resolution  measure. 
In  the  nonlinear  interaction  regime,  nonuniform  signal  conversion  due  to  fundamental  wave  depletion  gives 
rise  to  a  phenomenon  that  can  be  used  to  enhanced  the  imaging  operation.  We  experimentally  verify  this 
nonlinear  operation.  ©  2001  Optical  Society  of  America 
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1.  INTRODUCTION 

Ultrafast  temporal  signals  are  difficult  to  acquire,  as  di¬ 
rect  detection  of  the  temporal  information  is  impossible 
owing  to  the  limited  response  time  of  existing  solid-state 
optoelectronic  photodetectors.  Spatial  imaging  of  ul¬ 
trafast  waveforms  offers  an  invaluable  ability,  because 
the  complex-amplitude  temporal  information  is  converted 
to  a  spatial  distribution  for  subsequent  detection  by 
slower,  parallel  optoelectronic  devices.  Information 
about  the  temporal  waveform  envelope  can  be  inferred 
from  the  detected  one-dimensional  spatial  image.  Re¬ 
cently,  a  single-shot  technique  for  time-to-space  conver¬ 
sion  based  on  wave  mixing  spectrally  decomposed  waves 
(SDW’s)  has  been  introduced  and  experimentally 
demonstrated.1  The  time-to-space  conversion  is  gener¬ 
ated  by  three-wave  mixing  with  the  SDW  of  an 
information-carrying  ultrashort  temporal  waveform  and 
an  inverted  SDW  of  a  reference  ultrashort  pulse  in  the 
Fourier  plane  of  a  spectral  processing  device  (SPD)  (see 
Fig.  1).  The  two  SDWs  in  the  Fourier  plane  are  mixed  in 
a  ^<2)  crystal,  generating  a  quasi-monochromatic  wave  at 
a  doubled  frequency.  A  spatial  Fourier  transform  of  the 
generated  second-harmonic  field  to  the  output  plane  pro¬ 
duces  a  spatial  image,  thereby  performing  a  linear  map¬ 
ping  of  the  information  from  the  time-to-space  domain. 
The  novelty  of  this  technique  is  that  the  usual  relation¬ 
ship  between  the  time  window  of  observation  and  the 
physical  aperture  of  the  nonlinear  crystal2’3  is  decoupled 
by  spectral  processing.  Additionally,  the  spatially  dis¬ 
persed  frequency  components  automatically  compensate 
for  group  velocity  mismatch,  permitting  high  conversion 
efficiency.4  Moreover  quadratic  phase  compensation  is 
achieved  by  displacing  the  output  observation  plane,  per¬ 
mitting  chirp  cancellation  in  optical  fiber  transmissions.5 


These  desirable  features  make  this  time-to-space  con¬ 
verter  suited  for  future  ultrafast  optical  communication 
systems.6 

In  this  paper,  we  study  in  detail  the  operation  trade¬ 
offs  of  the  time-to-space  converter.  More  specifically,  we 
investigate  the  relationship  between  the  conversion  effi¬ 
ciency  and  the  observation  time  window.  This  informa¬ 
tion  is  important  in  the  design  of  an  optical  time-division 
multiplexing  system  based  on  ultrafast  pulse  packets. 
We  find  that  the  processor  can  be  set  to  operate  in  the  lin¬ 
ear  regime,  under  weak  three-wave  interaction,  or  in  the 
nonlinear  regime,  where  fundamental  wave  depletion  is 
exhibited.  For  the  linear  regime  we  derive  expressions 
for  the  output  spatial  signal  and  for  the  conversion  effi¬ 
ciency,  finding  that  the  efficiency-resolution  product  of 
the  processor  is  an  invariant  property;  improving  one  pa¬ 
rameter  degrades  the  other.  When  the  reference  pulse 
power  exceeds  certain  power  levels,  the  spatial  frequency 
bandwidth  of  the  second  harmonic  broadens  owing  to  non- 
uniform  signal  conversion.  The  nonlinear  operation  gen¬ 
erates  a  spatial  image  with  a  narrower  spot  size  than  in 
the  linear  regime,  resulting  in  an  enhanced  resolution 
output  image  due  to  the  nonlinear  signal  processing.7 
We  perform  numerical  simulations  for  this  nonlinear  con¬ 
version  case  based  on  a  Gaussian  pulse  model  and  find 
the  required  reference  pulse  power  levels  for  best  tempo¬ 
ral  resolution  and  maximal  conversion  efficiency. 

The  paper  is  organized  as  follows:  In  Section  2  we  es¬ 
tablish  the  Gaussian  beam  profile  and  Gaussian  pulse  en¬ 
velope  model  that  is  used  throughout  this  paper,  where 
we  find  the  relationship  between  the  size  of  the  beam  and 
the  pulse  duration  to  the  resolution  of  the  SPD.  The  lin¬ 
ear  and  the  nonlinear  operating  regimes  of  the  time-to- 
space  conversion  processor  are  investigated  in  Sections  3 
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Input  plane  Reference 


Fig.  1.  Time-to-space  conversion  by  using  wave  mixing  of  in¬ 
verted  spectrally  decomposed  waves  from  a  signal  and  a  refer¬ 
ence  pulse:  Fly  F2,  focal  length. 


and  4,  respectively.  Experimental  results  verifying  our 
analysis  in  terms  of  enhanced-resolution  imaging  are  pre¬ 
sented  in  Section  5,  followed  by  a  conclusion  in  Section  6. 


2.  GAUSSIAN  MODEL  OF  INPUT  WAVES 

The  signal  and  reference  pulses  that  are  used  in  the  time- 
to-space  conversion  are  introduced  to  the  SPD  from  two 
opposite  sides  (see  Fig.  1),  such  that  the  directions  of  their 
spatial  dispersions  will  be  mutually  inverted.  The  beams 
carrying  the  signal  and  reference  pulses  are  incident  on 
the  input  grating  of  the  SPD  at  angles  of +0  and  ~6  rela¬ 
tive  to  the  input  grating  normal,  respectively.  The  angle 
6  depends  on  the  grating  frequency  and  is  chosen  such 
that  the  diffracted  wave  at  the  center  frequency  of  the  ul¬ 
trashort  pulse  is  propagating  along  the  optical  axis  of  the 
SPD.  The  diffracted  field  immediately  behind  the  grat¬ 
ing  for  the  signal  pulse  can  be  expressed  as 


y\  t)  ~  ESW 


w 


X  exp(jo>0t)y 


(1) 


where  pit)  is  the  normalized  temporal  envelope  of  the  ul¬ 
trashort  pulse  and  w(x)  and  w{y)  are  the  normalized 
transversal  beam  profiles  in  the  x  and  y  directions  (as¬ 
suming  that  a  separable  function  characteristic  is  appli¬ 
cable),  defining  the  input  pupil  function.  The  param¬ 
eters  t,  Lx ,  and  Ly  characterize  the  pulse  duration  and 
the  size  of  the  input  pupil  function  on  the  grating  in  the  x 
and  the  y  directions,  respectively.  In  general,  Lx 
~~  Ey/cos($)  if  the  input  beam  is  circular,  owing  to  the 
projection  on  the  grating  at  angle  6.  However,  the  beam 
can  be  anamorphically  imaged  on  the  grating  to  achieve 
independent  control  over  Lx  and  Ly .  The  constants  Es , 
a,  c,tQ,  and  co0  are  the  field  strength,  grating  angular  dis¬ 
persion  parameter  [a  =  sin(0)],  speed  of  light  in  vacuum, 
arbitrary  time  shift,  and  center  angular  frequency  of  the 
ultrashort  pulse,  respectively.  Equation  (1)  describes  a 
short  pulse  scanning  across  the  fixed  aperture  at  a  veloc¬ 
ity  of  c/a  in  the  x  direction.  This  velocity  is  greater  than 
the  speed  of  light  in  vacuum  owing  to  the  projection  of  the 
pulse  front  on  a  tilted  plane. 

We  invoke  the  following  two  customary  assumptions8,9 
in  our  study:  temporal  Gaussian  pulse  envelope  and  spa¬ 
tial  Gaussian  beam  profile.  The  Gaussian  beam  profile  is 
commonly  used  for  spatial  confinement  of  propagating 
waves  and  is  a  known  solution  to  the  paraxial  Helmholtz 


equation.10  The  Gaussian  pulse  envelope  is  also  found  to 
be  a  good  descriptor  of  ultrashort  pulses.  We  define  the 
pulse  envelope  and  its  corresponding  temporal  Fourier 
transform  (TFT)  by 


p{t)  -  exp|  -—j  <=>  P((o)  =  V27rexp|  — —j,  (2) 

where  is  the  angular  frequency.  Similarly,  we  define 
the  Gaussian  spatial  beam  profile  and  its  corresponding 
spatial  Fourier  transform  (SFT)  by 


SFT 

w(x)  -  exp(  ~x2)  W(  fx)  =  \lrr exp(-7r2fx2),  (3) 


where  fx  is  the  spatial  frequency.  The  difference  be¬ 
tween  the  TFT  and  SFT  is  due  to  the  inclusion  of  2  it  in 
the  angular  frequency  term  of  the  TFT,  consistent  with 
standard  notation.  Using  the  Gaussian  model  for  the 
spatial  modes  and  temporal  profile,  we  rewrite  Eq.  (1)  as 


Uj n(^,y>0  ~~  EsBXp 


X  exp 


2 


exp(ja>0t). 


(4) 

Let  us  make  the  following  observation  (see  Fig.  2):  The 
width  of  the  spatial  beam  profile  in  the  x  direction  is  pro¬ 
portional  to  Lx .  The  instantaneous  spatial  width  of  the 
pulse  scanning  across  the  grating  is  proportional  to 
\l2rcfa.  The  ratio  of  these  two  widths,  N  a  ccLxI^2tc, 
is  a  measure  of  the  resolution  of  the  SPD  and  describes 
the  number  of  pulses  that  fit  into  the  input  pupil  function. 
Its  significance  is  similar  to  the  space-bandwidth  product 
in  a  traditional  spatial-domain  optical  processing  system 
as  will  be  illustrated  next.  (Note  that  N  is  identical  to 
Weiner’s  resolution  measure,9  expressed  using  the  pa¬ 
rameters  of  our  model.) 

The  input  field  of  Eq.  (4)  undergoes  a  SFT  by  the  first 
lens  of  the  SPD,  with  focal  length  Fx ,  generating  the  field 
in  the  Fourier  plane.  The  expression  is  simplified  by  us¬ 
ing  the  resolution  parameter  Ny  yielding  the  temporal 
field  in  local  spatial  coordinates  u  and  k 


Fig.  2.  Gaussian  beam  mode  and  Gaussian  pulse  envelope  inci¬ 
dent  on  the  diffraction  grating.  The  ratio  of  their  spatial  widths 
determines  the  resolution  of  the  system. 
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A  TFT  can  be  applied  to  Eq.  (5),  revealing  the  spatial  dis¬ 
persion  of  the  field,  given  by 

EsLJLyTrryl^Tr  [  r2 

UsDwiUy  V ;  (O)  =  - — - exp  (1  +  N2) 

A-o^  1  L  ^ 


X  ft)  -  COq  + 
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X  exp  - 


0  aFx  1  +iV2i 
ILw0u\2  1 
l  2  cFl )  1  +  N2. 


exp[-y(^  “  «o)*o]- 


Note  that  optical  fields  in  the  temporal  frequency  domain 
are  denoted  with  a  tilde  overscript,  U.  Equations  (5)  and 
(6)  are  the  time  domain  and  frequency  domain  represen¬ 
tations  of  the  spectrally  decomposed  wave11  (SDW)  at  the 
Fourier  plane.  Some  important  observations  can  be 
made  on  the  properties  of  the  SDW  from  inspecting  Eqs. 
(5)  and  (6):  (i)  From  Eq.  (5)  we  learn  that  the  duration  of 
the  SDW  is  increased  by  a  factor  of  Vl  +  N2.  This  time 
corresponds  to  the  time  of  flight  of  the  pulse  through  the 
aperture  in  the  input  plane  (see  Fig.  2).  (ii)  The  spatial 
dispersion  of  the  temporal  frequency  components  are  ob¬ 
served  in  the  first  Gaussian  function  of  Eq.  (6),  yielding 
da)/du  =  -((oo/aF^NVd  +  AT2).  The  spatial  disper¬ 
sion  is  manifest  in  Eq.  (5)  as  a  Doppler  shift  of  the  tem¬ 
poral  frequency  as  a  function  of  the  spatial  coordinate  u. 
The  Doppler  frequency  shift  arises  from  the  movement  of 
the  input  pulse  (or  light  source)  across  the  diffraction 
grating.  This  movement  also  leads  to  (iii)  The  phase 
fronts  of  the  SDW  field  [Eq.  (5)]  vary  as  a  function  of  time, 
and  the  instantaneous  angle  between  the  wave  vector  and 
the  optical  axis  of  the  SPD  in  the  paraxial  approximation 
is  (c/aFJiN2/!  +  N2){t  -  t0).  (iv)  The  temporal  band¬ 
width  at  any  location  u  is  determined  by  the  first  Gauss¬ 
ian  function  of  Eq.  (6)  and  is  a  (1  +  N2)y2  fraction  of  the 
total  bandwidth.  The  frequency  resolution  can  be  calcu¬ 
lated  by  dividing  the  total  width  of  the  field  [given  by  the 
second  Gaussian  function  of  Eq.  (6)]  by  the  width  for  a 
single  temporal-frequency  component  (given  by  the  first 
Gaussian  function  when  substituting  co  =  <o0,  for  ex¬ 
ample).  This  ratio  is  again  found  to  be  (1  +  N2)V2.  In 
the  limit  of  high-resolution  operation  (i.e.,  N  >  1),  we  will 


observe  that  the  SDW  duration  is  Nr,  the  bandwidth  at 
any  point  has  decreased  by  a  factor  of  N,  and  the  spatial 
resolution  of  frequency  components  is  N.  In  contrast,  if 
we  operate  at  the  low-resolution  limit  (i.e.,  N  <  1),  the 
SDW  duration  will  be  the  same  as  that  of  the  incident 
pulse  (i.e.,  r),  and  there  will  be  no  spatially  varying  Dop¬ 
pler  shift  term  and  no  spatial  resolution  of  the  frequency 
components.  In  this  low-resolution  case,  monochromatic 
diffraction  effects  are  more  pronounced  than  the  band¬ 
width  of  the  optical  source.10 

As  a  last  step,  we  calculate  the  confocal  length  of  the 
SDW,  required  as  an  approximate  limit  to  the  possible  in¬ 
teraction  length  in  the  wave-mixing  process  under  Gauss¬ 
ian  beam  focusing.  The  confocal  length  is  calculated 
from  the  spatial  width  (or  waist)  of  the  SDW  [from  Eq. 
(5)].  The  field  distribution  obeys  an  independent  Gauss¬ 
ian  mode  for  the  u  and  v  directions  or  an  elliptical  cross 
section.  We  find  that  the  two  independent  confocal 
lengths  for  the  u  and  the  v  directions  are  2 Z0u 

=  2n„/12\0(l  +  X2)IttL2  and  2Z0v  =  2na,0F12\0/ 

TiLy2,  respectively,  where  n^o  is  the  refractive  index  of 
the  crystal  at  frequency  <x>0 .  If  one  of  the  confocal 
lengths  is  significantly  smaller  than  the  other,  then  it 
serves  as  the  limit  to  the  interaction  length.  Usually 
Lx  Lv  and  N  >  1,  resulting  in  2 Z0v  as  the  limiting  in¬ 
teraction  length.  As  the  size  of  the  input  pupil  is  in¬ 
creased,  the  confocal  length  is  decreased. 

The  characteristics  of  the  spectrally  decomposed  waves 
are  required  in  our  analysis  of  the  time-to-space  conver¬ 
sion,  which  is  evaluated  in  Sections  3  and  4  in  the  linear 
and  the  nonlinear  regimes. 


3.  LINEAR  TIME-TO-SPACE  CONVERSION 

In  this  section  we  analyze  the  wave-mixing  process  and 
the  dependence  of  the  spatial  output  signal  on  the  param¬ 
eters  of  the  SPD  in  the  linear  regime,  where  the  funda¬ 
mental  input  waves  are  nondepleting.  When  this  as¬ 
sumption  is  used,  the  SDW  of  the  signal  and  reference 
pulses  are  completely  characterized  and  we  may  find  the 
nonlinear  polarization  term  that  arises  in  the  nonlinear 
crystal.  Armed  with  the  nonlinear  polarization,  we 
solve  the  wave  equation  for  the  generated  quasi- 
monochromatic  third  wave,  whose  spatial  frequency 
modulation  carries  the  temporal-frequency  information  of 
the  signal  pulse.  This  resultant  wave  undergoes  a  SFT 
to  the  output  plane  of  the  SPD,  giving  rise  to  a  spatial  sig¬ 
nal  that  contains  the  desired  temporal  location  informa¬ 
tion  t0  (see  Fig.  1).  Finally,  we  calculate  the  total  energy 
that  has  been  converted  to  the  spatial  domain,  from 
which  we  can  find  the  conversion  efficiency  from  the  input 
temporal  pulse  to  the  output  spatial  signal.  This  ap¬ 
proach  differs  from  Weiner’s  conversion  efficiency  estima¬ 
tion,  which  is  based  on  the  SDW^  averaged  power 
density4  (averaged  in  duration  and  distribution).  We  be¬ 
lieve  that  our  approach  is  justified  since  the  conversion  is 
produced  by  a  fast  parametric  point  process,  and  there¬ 
fore  one  should  take  into  account  the  temporal  dynamics 
and  field  distribution  of  the  SDWX  as  evaluated  in  Sec¬ 
tion  2. 
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A.  Nonlinear  Polarization 

The  two  mutually  inverted  SD Ws  of  the  signal  pulse  and 
the  reference  pulse  interact  by  using  a  three-wave-mixing 
process  in  a  crystal  exhibiting  a  second-order  nonlinear¬ 
ity.  Since  we  are  operating  in  the  linear  interaction  re¬ 
gime,  the  analysis  can  be  performed  in  the  temporal  fre¬ 
quency  domain.  Therefore  the  nonlinear  polarization 
can  be  expressed  as  a  convolution  of  the  two  input  waves 
in  the  temporal  frequency  domain.  The  SDW  of  the  sig¬ 
nal  pulse  in  the  temporal  frequency  domain  is  given  by 
Eq.  (6).  The  reference  pulse  generates  a  similar  SDW 
with  an  inverted  spatial  dispersion  sign.  Consequently, 
we  can  express  the  nonlinear  polarization  term  as12 

P^XUy  v ,  z;  a>) 

“  [  ^sdw(^>  v\  x*j)Usdw(u>  O)  -  nr) 


x  exP[j[&s(^)  +  kR(co  -  tzr)]z]dts,  (7) 

where  is  the  nonlinear  optical  coefficient,  which  is  as¬ 
sumed  to  be  constant  across  the  interaction  bandwidth  as 
a  nonresonant  nonlinearity  is  used,  S  and  R  denote  the 
signal  and  the  reference  SDW  waves,  ks(  •)  and  kR(  •)  are 
the  wave-vector  magnitudes  as  a  function  of  the  temporal 
frequency,  and  the  integration  variable  w  has  units  of 
temporal  frequency.  We  express  the  wave-vector  depen¬ 
dence  on  a  temporal  frequency  variable  ft  by  using  a  Tay¬ 
lor  series  expansion  about  ft  =  co0 ,  yielding 

dk(ft) 

ks{w)  =  k((o0)  4*  (t*t  “  (Dq) - — 

dft 

“_wo 
dk(Cl) 

—  w)  =  k((Oo)  +  (w  -  VJ  —  (Oq ) - 

dft 

“~w0 

(Note:  we  assume  a  type  I  interaction  where  the  two  fun¬ 
damental  waves  are  polarized  parallel  to  the  crystal’s  or¬ 
dinary  axis.  Therefore  the  Taylor  coefficients  for  the  two 
expansions  are  identical.)  When  we  express  the  sum  of 
the  two  k  vectors  by  using  only  the  first  two  terms  of  the 
series  expansion,  the  resultant  can  be  shown  to  be  inde¬ 
pendent  of  tu  and  can  therefore  be  taken  out  of  the  inte¬ 
gral.  The  nonlinear  polarization  can  therefore  be  ap¬ 
proximated  by 

v,z;  (O) 


=  2e0deffexp 


2k(a>0) 


+  k(aj  —  2(x)q) 


dk(fl) 

dft 


G(u,v;a>)9  (8) 


where 

G(uf  v,  co)  —  I  &sdw(^>  'w)£7guw(w>  v\  to  —  Tn)dtj 

J  -00 

is  the  convolution  operation  in  the  temporal  frequency  do¬ 
main. 


B.  Generated  Wave 

The  nonlinear  polarization  of  Eq.  (8)  serves  as  the  driving 
force  for  the  wave  equation  of  the  generated  third  wave, 
governed  by12 

d  ~  j  c 02/jLq 

^sum( u  >  V*  z  j  a))  ~  ~  Vy  Z\  (o) 

dz  Z  fe{(o) 

X  exp[  —jk sum( <o)z],  (9) 

which  is  the  differential  equation  obtained  from  the 
monochromatic  plane-wave  solution,  when  the  slowly 
varying  envelope  and  nondepleting  pump  approximations 
are  used.  We  next  expand  ksum((o)  in  a  Taylor  series 
about  fl  =  2co0,  keeping  only  the  first  two  terms  of  the 
expansion.  This  procedure  is  carried  out  for  the  phase 
term  only,  because  it  will  oscillate  rapidly  with  propaga¬ 
tion  along  z.  Using  the  Taylor  approximation  and  insert¬ 
ing  the  nonlinear  polarization  of  Eq.  (8)  into  Eq.  (9)  yields 

d  ~ 

7“^sum(u? 

dz 


2o)0 

~  J - deffG(u,  v;  (o) 

cn2a>0 


X  exp[-y[&(2o>0)  -  2&(fc>0)]2]exp{-y(<l)  -  2 <o0) 

x  [{dk/dn)\^r(dk/da)\n^o]z}.  do) 

We  assume  that  the  phase-matching  condition,  k(2<o0 ) 
=  2k((ti0)}  can  be  satisfied  by  proper  crystal  selection 
and  orientation.  Equation  (10)  can  be  integrated  from  0 
to  Lc ,  the  length  of  the  crystal,  assuming  a  zero  initial 
state  [i.e.,  Usum(u,  v ,  0;  <o)  =  0].  The  field  outside  the 
output  face  of  the  crystal  is  given  by 
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with  A( co)  defined  by 


A((o)  =  exp 


(<o  -  2 (o0)2r2  1  +  N2 
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{(x)  -  2w0)« 


PLC 


X  exp 
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where  0  =  (dk/dfl) |a=2ft>o  -  (dk/dfl) |ft=W()  is  the  inverse 
group  velocity  mismatch  and  we  used  the  explicit  solution 
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of  the  convolution  G(u,  v;  w)  in  deriving  Eq.  (11).  We 
observe  that  the  spatial  dependence  of  the  output  signal 
has  been  decoupled  from  the  temporal-frequency  variable. 
This  is  a  fundamental  property  of  wave  mixing  two  mu¬ 
tually  inverted  SOW’S.  The  spatial  dependence  in  Eq. 
(11)  contains  the  following  components:  (i)  a  confined 
spatial  extent  prescribed  by  the  spatial  Gaussian  func¬ 
tions,  (ii)  an  attenuation  factor  with  Gaussian  depen¬ 
dence  on  the  time  delay  t0  between  the  signal  and  the  ref¬ 
erence  pulses,  and  (iii)  a  spatial  linear  phase  term  that 
carries  the  timing  information  t0 .  The  temporal- 
frequency  dependence  of  A(o>)  consists  of  the  generated 
Gaussian  bandwidth  multiplied  by  a  spectral  filter  in  the 
form  of  the  sine  function.  The  generated  sum  wave  can 
also  be  expressed  in  the  time  domain  when  an  inverse 
TFT  is  performed,  which  would  result  in  the  replacement 
of  A(o))  in  Eq.  (11)  by  a(t ),  given  by 


output  values  to  the  spot  size  reduces  exactly  to  N \  the 
resolution  of  the  SPD.  This  result  is  consistent  with  our 
linear  conversion  regime,  illustrating  that  the  space- 
bandwidth  product  is  preserved.  We  next  analyze  the 
conversion  efficiency  of  the  processor. 

D.  Conversion  Efficiency 

The  time-to-space  processor  functions  in  real  time,  gener¬ 
ating  the  time-varying  output  spatial  field.  However,  all 
detection  devices  respond  to  the  optical  intensity.  The 
time-dependent  intensity  is  calculated  from  Eq.  (14), 
where  the  amplitudes  of  the  signal  and  reference  pulses 
have  been  expressed  by  their  relationship  to  the  signal 
and  the  reference  pulse  energies  £s  and  Sr ,  respectively 
(see  Appendix  A)  along  with  the  definitions  relating  the 
refractive  index,  the  electric  permeability,  and  the  speed 
of  light  in  vacuum,  yielding 


Since  the  TFT  of  a  product  is  a  convolution,  a(t )  is  defined 
by  a  convolution  between  Gaussian  and  rectangular  tem¬ 
poral  envelopes  with  a  center  frequency  of  2o)0.  In  the 
case  that  j3Lc  <  tN,  spectral  filtering  due  to  group  veloc¬ 
ity  mismatch  will  be  negligible  and  the  spectrum  A(w) 
and  the  temporal  profile  a(t)  will  be  Gaussian.  In  con¬ 
trast,  when  /3LC  §>  rN,  the  spectrum  will  have  a  sine  de¬ 
pendence  due  to  phase-matching  considerations  and  the 
temporal  profile  will  be  rectangular. 


C.  Output  Spatial  Signal 

The  field  at  the  output  plane  of  the  SPD  is  evaluated  by 
the  SFT  of  the  generated  field  from  the  three-wave- 
mixing  process.  We  express  this  output  field  in  the  time 
domain,  yielding 
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(14) 


The  last  two  Gaussian  terms  in  Eq.  (14)  characterize  the 
spot  size  of  the  output  spatial  signal.  The  location  of  this 
Gaussian  signal  is  determined  by  the  temporal  informa¬ 
tion  /0 ,  weighted  by  the  SPD  parameters.  This  term  rep¬ 
resents  the  time-to-space  converted  signal.  The  ampli¬ 
tude  of  the  output  signal  is  weighted  by  the  Gaussian 
time  window  of  the  SPD,  limiting  the  possible  values  of 
t0.  The  resolution  of  the  time-to-space  converter  is  de¬ 
termined  by  these  two  parameters.  It  can  be  shown  that 
the  ratio  of  the  spatial  widths  of  the  observed  range  of 


with  a  change  of  integration  variable  ft  =  <o  -  2<u0. 
The  output  energy  can  be  expressed  in  analytic  form  for 
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two  limiting  cases,  when  the  spectral  filtering  is  negli¬ 
gible  (/3LC  <€  tN)  or  dominant  (/?LC  tN). 

In  the  limit  of  a  high-resolution  SPD  (i.e.,  a  large  pupil 
mode  size,  leading  to  N  >  1),  which  is  usually  the  case  of 
interest,  the  Gaussian  spectrum  will  be  much  narrower 
than  the  sine  function.  We  find  that  the  output  energy  is 
given  by 


font  — 


d2 


eff 


212/27T9/%£r- 


V  LJL, 

e0c  k(j4F12  tN2 


x  exp 


2  r2iV2/‘ 


(18) 


The  actual  interaction  length  in  the  crystal  Lc ,  is  deter¬ 
mined  by  the  minimum  of  either  one  of  the  two  confocal 
lengths  of  the  SDW  or  the  actual  length  of  the  crystal. 

The  output  energy  can  also  be  evaluated  for  the  case  of 
a  very  small  pupil  mode  size.  In  this  case,  spectral  filter¬ 
ing  due  to  the  group  velocity  mismatch  will  be  predomi¬ 
nant  in  Eq.  (17).  This  extreme  case  is  of  no  interest  to  us 
because  there  is  no  time-to-space  conversion. 


E.  Efficiency-Resolution  Product 

Let  the  conversion  efficiency  be  defined  as  77  =  £out/£s , 
the  total  energy  converted  from  the  time-to-space  do¬ 
mains.  We  can  manipulate  Eq.  (18)  to  express  the  prod¬ 
uct  of  the  conversion  efficiency  and  N,  the  resolution  of 
the  SPD.  After  simplifying  we  find  that 
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eff 
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Thus  the  efficiency-resolution  product  is  invariant  to  Lx . 
One  parameter  can  be  improved  at  the  expense  of  the 
other,  under  the  assumption  that  Lc  is  less  than  the  con- 
focal  distance.  The  implication  of  this  result  is  that  the 
performance  can  be  improved  by  increasing  Ly  and/or  de¬ 
creasing  F 1 ,  resulting  in  tighter  focusing  of  the  SDW. 
However,  this  leads  to  a  reduction  in  the  confocal  length. 

In  the  limiting  case  of  confocal  focusing,  where  we 
choose  the  interaction  length  to  equal  the  confocal  length 
in  the  vertical  direction  as  defined  in  Section  2  (because  it 
is  shorter  than  the  confocal  length  in  the  horizontal  direc¬ 
tion  when  Lx  ~  Ly),  we  find  that 


yN  = 


<*eff 
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It  is  possible  to  define  /#  —  Fi  fLv  ,  and  conclude  that  the 
performance  of  the  time-to-space  converter  improves  as 
/*#  increases.  Since  Ly  Lx  and  N  «  Lx,  for  a  large 
time  window  to  be  converted  to  a  space-domain  image 
with  a  confocal  crystal  length,  a  severe  conversion  effi¬ 
ciency  penalty  must  be  paid. 

The  invariance  of  the  efficiency-resolution  product  to 
Lx  could  be  intuitively  explained  by  the  following  reason¬ 
ing:  The  duration  of  the  input  field  to  the  SPD  is  propor¬ 
tional  to  Lx  (input  spatial  mode),  while  the  field  ampli¬ 
tude  is  proportional  to  1A/lJ  (see  Appendix  A).  The 
corresponding  SDW  has  the  same  dependence  on  Lx  for 
both  amplitude  and  duration  parameters.  The  ampli¬ 
tude  of  the  generated  second-harmonic  field  under  the  lin¬ 


ear  conversion  regime  is  therefore  proportional  to  ULX , 
while  the  duration  is  still  proportional  to  Lx .  The  power 
of  the  generated  field  is  proportional  to  1 !L2  with  no 
change  in  the  duration.  Consequently,  the  energy  of  the 
spatial  signal  is  proportional  to  1 1LX ,  as  is  the  conversion 
efficiency.  Since  the  resolution  N  is  proportional  to  Lx , 
the  efficiency-resolution  product  is  invariant  to  Lx . 

The  expressions  derived  in  this  subsection  are  valid  for 
weak  interaction  leading  to  the  linear  conversion  behav¬ 
ior.  However,  in  many  practical  cases  of  interest,  the 
analyzed  ultrafast  signal  is  much  weaker  than  the  refer¬ 
ence  pulse.  While  this  arrangement  can  lead  to  a  high 
conversion  efficiency,  it  may  violate  our  basic  assumption 
of  linear  interaction  with  nondepleting  fundamental 
waves.  Therefore  a  complete  analysis  should  allow 
fundamental  wave  depletion,  and  it  is  conducted  in  Sec¬ 
tion  4. 


4.  NONLINEAR  TIME-TO-SPACE 
CONVERSION 

The  three-wave-mixing  process  of  the  time-to-space  con¬ 
verter  is  reexamined  in  this  section  under  the  assumption 
of  a  strong  reference  pulse  interacting  with  a  weak  signal 
pulse.  The  reference  pulse’s  SDW  is  considered  to  be 
nondepleting,  while  the  SDW  of  the  signal  pulse  and  the 
generated  quasi-monochromatic  second-harmonic  wave 
will  experience  periodic  power  exchanges.  Owing  to  the 
temporal  dynamics  of  the  interacting  waves,  the  analysis 
needs  to  be  performed  in  the  time  domain  (as  opposed  to 
the  temporal  frequency  domain  of  Section  3),  requiring 
the  introduction  of  a  time  derivative  to  the  coupled- wave 
equations.  We  resolve  this  complication  by  restricting 
our  solution  to  the  high-resolution  case,  where  the  tempo¬ 
ral  bandwidth  of  the  waves  at  any  spatial  location  de¬ 
creases  and  approaches  a  quasi-monochromatic  wave. 
Thus  we  can  use  the  quasi-static  solution  to  solve  the 
wave-mixing  process,  ignoring  the  effect  of  group  velocity 
mismatch  (more  thorough  justification  provided  in  Ap¬ 
pendix  B). 


A.  High-Resolution  Spectrally  Decomposed  Waves 
A  general  expression  for  the  signal  pulse’s  SDW  in  the 
high-resolution  limit  can  be  evaluated  by  the  SFT  of  Eq. 
(1),  yielding 

K  t) 
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c(t  —  /q) 
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x  P 


I  UTO)0\ 

co0u 
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-  '•> 

exp(y*>00, 


(21) 


where  W(  )  and  P(«)  are  the  SFT  and  the  TFT  of  u>(  •)  and 
P  ( * ) ,  respectively.  In  our  limiting  case  of  high-resolution 
SDW,  the  spatial  extent  in  the  *  direction  of  the  moving 
ultrashort  pulse  in  Eq.  (1)  is  much  smaller  than  the  size 
of  the  beam.  Thus,  in  deriving  Eq.  (21),  we  simplified  the 
Fourier  integral  by  sampling  the  spatial  mode  at  the  lo¬ 
cation  of  the  short  pulse  and  taking  the  aperture  effect 
out  of  the  integral.  The  temporal  frequency  dependence 
of  the  SDW,  evaluated  by  the  TFT  of  Eq.  (21),  is  given  by 
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The  term  mixing  the  temporal  frequency  components 
with  the  spatial  coordinate  u  in  the  SFT  of  the  input  ap¬ 
erture  characterizes  the  spatial  dispersion.  Since  the 
beam  aperture  Lx  is  large  in  the  high-resolution  case,  its 
SFT  is  a  very  narrow  function.  Its  width  determines  the 
temporal  frequency  overlap  or  bandwidth  Ao>  at  any  point 
on  the  spatial  coordinate  u.  The  center  temporal  fre¬ 
quency  component  at  the  spatial  coordinate  u  for  the  sig¬ 
nal  SDW  is  given  by 
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where  Sws  and  Scor  are  frequency  terms  that  are  con¬ 
tained  in  the  bandwidth  A co  at  each  spatial  location  for 
the  signal  and  reference  SDW.  Equation  (24)  states  that 
the  phase  mismatch  is  bound  by  the  product  of  the  in¬ 
verse  group  velocity  mismatch  ft  (defined  in  Subsection 
3.B)  and  the  bandwidth  Aco  at  any  point  on  the  spatial  co¬ 
ordinate  u.  Since  the  bandwidth  is  inversely  propor¬ 
tional  to  Lx ,  the  phase  mismatch  can  be  reduced  by  in¬ 
creasing  Lx  until  the  effect  of  this  phase  mismatch  can  be 
ignored  (see  Appendix  B).  In  deriving  the  phase  mis¬ 
match  bound  of  Eq.  (24),  we  assume  that  the  constant 
term  of  the  phase  mismatch  is  eliminated  by  proper  crys¬ 
tal  orientation,  set  to  satisfy  the  phase -matching  condi¬ 
tion.  Additionally,  owing  to  the  opposite  spatial  disper¬ 
sion  directions,  wsig  +  <°ref~  2wo  [see  Eqs.  (23a)  and 
(23b)] .  Finally,  So)s  and  $<or  are  bound  on  ( -  A  w/2,  A  co/2) 
such  that  their  sum  is  always  less  than  or  equal  to  |Ao>|. 


U  O)0 

tosig=  toQ  —  •  (23a) 

The  reference  pulse  SDW  can  be  described  by  expressions 
similar  to  Eqs.  (21)  and  (22),  yet  with  an  opposite  spatial 
dispersion  direction.  Its  temporal  frequency  to  spatial 
coordinate  relationship  is  therefore  given  by 


UO)0 

wref  =  .  (23b) 

We  next  calculate  the  phase  mismatch  of  the  wave-mixing 
process  at  an  arbitrary  spatial  location  u  due  to  the  tem¬ 
poral  frequency  bandwidth. 

B.  Phase  Mismatch  in  Wave-Mixing  High-Resolution 
Spectrally  Decomposed  Waves 

The  phase  mismatch  between  the  generated  spatial  wave 
and  the  two  fundamental  SDW^s  at  any  spatial  location  is 
a  function  of  the  wave-vector  dependence  on  temporal  fre¬ 
quency.  We  express  this  dependence  by  a  Taylor  expan¬ 
sion  about  ct>0  for  the  fundamental  SDW  and  about  2co0 
for  the  generated  second-harmonic  wave,  as  has  been 
done  in  the  linear  analysis  case.  When  the  Taylor  expan¬ 
sions  are  used,  the  phase  mismatch  is  given  by 
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C.  Solution  of  Nonlinear  Wave-Mixing  Spectrally 
Decomposed  Waves 

By  increasing  the  SDW  resolution,  both  the  bandwidth 
and  phase  mismatch  at  each  location  are  reduced,  such 
that  we  can  assume  that  in  the  high-resolution  limit  the 
waves  are  essentially  quasi-monoehromatic  and  the  three 
interacting  waves  are  phase  matched.  Thus  the  nonlin¬ 
ear  wave-mixing  problem  can  be  treated  with  the  quasi¬ 
static  approximation,  with  the  differential  equations 
characterizing  the  three  wave  mixing  defined  by12 

d  c 
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dz 
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cn2(o0 

where  we  assume  that  the  SDW  arising  from  the  refer¬ 
ence  pulse  £/§dw( u ,  v;  t)  is  nondepleting.  This  set  of 
coupled  equations  resemble  those  of  the  frequency-sum 
process  with  monochromatic  plane  waves,  and  their  solu¬ 
tion  is  known  to  be  oscillatory  with  respect  to  z.  We  ap¬ 
ply  the  oscillatory  solution  with  zero  initial  condition  for 
the  generated  sum  wave,  yielding 
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One  can  make  a  small-signal  approximation  (i.e.,  sin(x) 
=  x],  yielding  the  results  for  the  linear  time-to-space  con¬ 
version  process  in  the  case  of  negligible  spectral  filtering 
(or  high  resolution).  A  general  form  of  the  SDW,  see  Eq. 
(21),  or  a  Gaussian  pulse  and  Gaussian  spatial  mode 
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model,  see  Eq.  (5)  with  N  >  1,  can  be  used  in  Eq.  (26)  to 
characterize  the  generated  wave.  However,  the  nonlin¬ 
ear  relationship  on  the  amplitude  of  the  reference  SDW 
prohibits  continuing  the  analysis  to  the  output  plane  of 
the  processor.  Moreover,  the  reference  SDW  amplitude 
is  time  varying  according  to  the  spatial  mode  of  the  input 
beam.  A  computation  program  can  calculate  the  SFT  of 
the  field  at  the  output  of  a  crystal  of  length  Lc  at  all  times 
t  to  generate  the  field  at  the  observation  plane.  With  the 
time-varying  field,  the  detected  instantaneous  intensity 
can  be  computed  as  well  as  the  total  energy  conversion 
from  the  input  signal  pulse  energy  to  the  output  spatial 
wave  energy. 

However,  we  can  calculate  the  conversion  efficiency 
from  Eq.  (26).  Using  the  relationship  relating  field  and 
intensity,  we  find  that 


~  2/fDW(«,  v ,  0;  f)sin2j  2*77 


f  4fsDw(u>  v\  0 


(27) 

For  total  conversion  efficiency,  Eq.  (27)  needs  to  be  inte¬ 
grated  over  the  spatial  and  the  temporal  extent.  The  fac¬ 
tor  of  2  arises  from  an  equal  energy  contribution  from  the 
reference  SDW  for  every  converted  photon  from  the  signal 
SDW. 


D.  Modeling  and  Numerical  Solution 

To  characterize  the  nonlinear  time-to-space  conversion,  a 
program  for  calculating  the  output  spatial  signal  and  its 
accompanying  properties  was  employed,  by  using  a 
Gaussian  pulse  envelope  and  a  rectangular  beam  profile. 
The  rectangular  window  eliminates  the  temporal  varia¬ 
tions  of  the  reference  pulse  amplitude,  simplifying  the 
calculations.  Since  we  are  interested  in  qualitative  in¬ 
sight  into  the  nonlinear  conversion  effect,  we  computed 
the  output  spatial  signal  from  a  generated  signal  of  the 
form  exp(  -;r )sin[i£  exp(  -x2)].  The  parameter  K  ac- 
counts  for  the  amplitude  of  the  nonlinear  optical  co¬ 
efficient  d  eff ,  and  the  other  constants  such  as  the  crystal 
length  and  refractive  indices  in  Eq.  (26).  The  magnitude 
squared  of  the  Fourier  transform  of  the  output  field  rep¬ 
resents  the  output  intensity  distribution  observed  in  the 
time-to-space  conversion. 

The  results  of  the  simulation  (see  Fig.  3)  illustrate  the 
following:  For  small  values  of  K  the  sine  function  be¬ 
haves  linearly  and  the  generated  wave  is  weak.  Its 
shape  is  of  a  smooth  Gaussian  envelope,  both  in  the  Fou¬ 
rier  transform  plane  and  at  the  output  plane.  As  the 
value  of  K  increases,  the  generated  wave  in  the  Fourier 
plane  exhibits  a  nonlinear  amplitude  increase  across  its 
spatial  distribution.  The  increase  rate  is  reduced  near 
the  center  of  the  Gaussian  mode  and  approaches  its  maxi¬ 
mal  value,  in  accordance  with  a  sine  function  behavior. 
Consequently,  owing  to  the  Fourier  transform  relation¬ 
ship,  the  corresponding  pulse  image  at  the  output  be¬ 
comes  narrower.  This  image-narrowing  feature  signifies 
an  improvement  in  the  resolution  capability  of  the  time- 
to-space  converter,  as  the  total  time  window  did  not 
change.  For  greater  values  of  K  a  dip  in  the  generated 
wave  is  developed  due  to  a  downconversion  process  of 


photons  that  have  been  initially  generated  in  a  frequency- 
sum  process.  For  large  values  of  K,  complete  modulation 
of  the  Gaussian  spatial  mode  occurs,  resulting  in  two 
separate  Gaussian  diffraction  spots.  The  phenomenon  of 
a  nonuniform  signal  depletion  across  the  transverse  spa¬ 
tial  profile  is  also  observed  when  focusing  beams  for 
second-harmonic  generation  and  optical  parametric  am¬ 
plification  experiments.13-15  Figure  4  shows  a  compari¬ 
son  of  the  output  image’s  spatial  distribution  by  superim¬ 
posing  the  normalized  traces  for  small  K  (linear 
conversion)  and  K  =  2.5  (nonlinear  conversion).  The 
image-narrowing  feature,  resulting  in  enhanced  resolu¬ 
tion,  is  clearly  evident. 

Two  criteria  were  used  to  compare  the  output  images: 
A  FWHM  measure  and  a  normalized  second  moment  of 
the  profile,  i.e., 

JVj  out(^)ch  j  ^  Ioui(x)dx. 

A  plot  of  the  two  width  criteria  as  a  function  of  K  relative 
to  the  linear  conversion  case  shows  that  an  optimal  value 
exists  for  achieving  the  best  resolution  (see  Fig.  5).  K 
values  exceeding  the  optimal  value  result  in  sidelobes  in 


Generated  wave  profile  Pulse  image 

Fig.  3.  Field  distribution  of  the  converted  wave  in  the  Fourier 
plane  (left  column)  and  the  intensity  distribution  in  the  output 
plane  after  a  spatial  Fourier  transform  (right  column)  as  a  func¬ 
tion  of  the  reference  pulse  power  (parameterized  by  K). 
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Fig.  4.  Enhanced-resolution  imaging:  comparison  of  the  out¬ 
put  image  for  linear  (dashed  curve)  and  nonlinear  (solid  curve  for 
K  =  2.5)  conversions. 


K-factor 


Fig.  5.  Resolution  improvement  of  the  output  signal  as  a  func¬ 
tion  of  reference  pulse  power  K.  Dashed  curve,  FWHM  criteria; 
solid  curve,  standard  deviation  measure. 


o.i  i  io 

K  -  reference  pulse  energy  factor 


Fig.  6.  Conversion  efficiency  as  a  function  of  reference  pulse 
power  K.  The  linear  conversion  is  shown  to  provide  acceptable 
results  for  K  <  0.5. 

the  output  image  due  to  modulation  in  the  Fourier  plane, 
thereby  reducing  the  resolution.  The  FWHM  measure 
exhibits  an  abrupt  increase  when  the  intensity  of  the 
slidelobes  exceeds  the  half-maximum  value,  whereas  the 
second  moment  varies  smoothly. 

The  conversion  efficiency  was  calculated  with  Eq.  (27) 
and  tQ  =  0  as  a  function  of  K  (see  Fig.  6).  The  efficiency 
in  the  linear  case  [when  sinOc)  =  x  is  used]  is  plotted  as 
well  and  follows  a  20-dB/dec  slope.  We  can  conclude  that 


the  linear  case  is  applicable  for  K  <  0.5.  The  maximum 
conversion  efficiency  is  approximately  1.80,  or  90%  of  the 
photons  in  the  input  signal  pulse.  For  large  values  of  K 
the  conversion  efficiency  oscillates  about  1;  yet  the  corre¬ 
sponding  pulse  images  consist  of  two  diffracted  signals 
and  are  of  no  interest  for  time-to-space  conversion. 

The  new  result  predicting  enhanced-resolution  time-to- 
space  mapping  was  observed  experimentally  and  is  de¬ 
scribed  next. 


5.  ENHANCED-RESOLUTION  EXPERIMENT 

To  demonstrate  the  enhanced-resolution  time-to-space 
conversion,  we  operated  the  pulse  imager  in  the  nonlinear 
regime.  We  used  a  high-power  ultrashort  pulse  laser 
producing  1-mJ  pulses  of  100-fs  duration  at  an  800-nm 
center  wavelength.  The  pulses  were  divided  by  a  beam 
splitter  to  create  a  signal  and  a  reference  pulse  (5%  and 
95%  of  the  power,  respectively).  The  SDW’s  of  the 
pulses,  generated  by  600-line/mm  gratings  and  a  375-mm 
lens,  were  mixed  in  a  2-mm-long  j3  barium  borate  (BBO) 
crystal  in  a  type  I  phase -matching  configuration.  The 
reference  pulse  power  was  adjusted  by  a  neutral-density 
(ND)  filter  wheel  for  experimentation  with  different  val¬ 
ues  of  the  parameter  K.  (Large  ND  values  correspond  to 
small  values  of  K.)  The  intensity  distribution  of  the  gen¬ 
erated  wave  in  the  Fourier  plane  (as  opposed  to  the  am¬ 
plitude  in  the  theoretical  plots)  and  their  corresponding 
output  image  intensity  profile  were  detected  as  a  function 
of  the  ND  setting  (see  Fig.  7).  As  predicted  in  the  corn- 


spatial  frequency  plane  [a.u.] 


Time  [fs] 

Fig.  7.  Experimental  results:  (a)  intensity  distribution  in  the 
Fourier  plane,  (b)  output  pulse  image,  as  functions  of  the  refer¬ 
ence  pulse  attenuation.  A  high  ND  setting  corresponds  to  a 
small  K  value.  Broadening  in  the  Fourier  plane  and  narrowing 
in  the  corresponding  pulse  images  are  exhibited. 
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Normalized  square  root  of  reference  pulse  power 

Fig.  8.  Measured  conversion  efficiency  as  a  function  of  the 
square  root  of  the  reference  pulse  power.  Conversion  follows  the 
linear  slope  of  20  dB/ dec. 

puter  simulation  program,  at  a  low  ND  setting  the  gener¬ 
ated  wave  in  the  Fourier  plane  is  broader  and  its  corre¬ 
sponding  output  image  is  narrower.  A  plot  of  the 
conversion  efficiency  versus  the  square  root  of  the  refer¬ 
ence  pulse  power  shows  that  we  are  still  operating  in  the 
vicinity  of  the  linear  conversion  zone  (see  Fig.  8),  as  no 
saturation  effects  are  observed.  This  can  be  attributed  to 
using  a  Gaussian  spatial  mode,  which  varies  in  time, 
causing  the  nonlinear  conversion  to  be  less  pronounced 
than  in  the  rectangular  window  used  in  the  nonlinear 
conversion  analysis. 


realized  experimentally,  demonstrating  the  expected 
enhanced-resolution  performance. 

The  three-wave-mixing  process  was  analyzed  with  the 
assumption  of  a  bulk  nonlinear  crystal.  However,  it 
should  be  possible  to  achieve  other  functionalities,  such 
as  further  increasing  the  resolution  enhancement,  by  lat¬ 
eral  patterning  of  the  nonlinear  ciystal.  This  has  been 
demonstrated  in  quasi-phased-matched  periodically  poled 
LiNb03,  for  generation  of  a  flat-top  field.16  The  same 
principle  can  be  applied  to  the  time-to-space  processor, 
which  may  achieve  an  even  finer  spot  size  after  a  SFT. 


APPENDIX  A:  PULSE  AMPLITUDE  AND 
ENERGY  RELATIONSHIP 

The  SPD  resolution  was  shown  to  depend  on  the  input  pu¬ 
pil  size  on  the  diffraction  grating  with  the  increasing  reso¬ 
lution  and  time  window  at  a  larger  pupil.  For  the  pur¬ 
pose  of  the  time-to-space  converter,  the  nonlinear  wave- 
mixing  efficiency  depends  on  the  instantaneous  input 
powers.  When  the  mode  on  the  grating  is  increased  by 
use  of  two  lenses  in  a  telescope  configuration,  a  larger 
spatial  mode  can  be  supported,  but  the  peak  power  is  re¬ 
duced  as  the  pulse  carries  finite  energy.  In  this  Appen¬ 
dix  we  find  the  relationship  between  the  pulse  energy  and 
peak  power  by  using  our  Gaussian  pulse  and  Gaussian 
spatial  mode  model. 

Using  the  expression  for  the  input  field  on  the  diffrac¬ 
tion  grating  of  the  SPD,  Eq.  (4),  we  find  the  instantaneous 
intensity  distribution,  given  by 


6.  CONCLUSION 

The  operational  characteristics  of  a  time-to-space  proces¬ 
sor  based  on  three  wave  mixing  for  instantaneous  imag¬ 
ing  of  ultrafast  waveforms  has  been  analyzed.  A  model 
based  on  a  Gaussian  pulse  envelope  and  a  Gaussian  spa¬ 
tial  mode  has  been  developed.  This  model  closely  de¬ 
scribes  most  cases  of  interest,  where  ultrashort  pulse 
packets  from  a  possible  optical  data  communication  sys¬ 
tem  are  to  be  detected.  Using  the  properties  of  the 
Gaussian  model  and  the  SPD,  we  define  the  resolution  pa¬ 
rameter  of  the  processor.  The  resolution  parameter  dic¬ 
tates  the  observation  time  window  for  our  processor  and 
also  plays  a  vital  role  in  the  wave-mixing  process. 

Both  the  linear  and  the  nonlinear  conversion  regimes 
were  investigated,  yielding  the  output  spatial  signal  form 
and  conversion  efficiency.  In  the  linear  conversion  re¬ 
gime  we  find  that  we  can  define  an  efficiency-resolution 
product,  which  is  an  invariant  measure  depending  on  the 
system  parameters.  Thus  efficiency  can  be  traded  for 
resolution  and  vice  versa.  In  the  nonlinear  regime  we 
show  that,  by  using  a  strong  reference  pulse,  we  can  en¬ 
hance  the  spatial  bandwidth  of  the  signal,  thereby  achiev¬ 
ing  enhanced  temporal-resolution  imaging.  Numerical 
simulation  results  are  used  to  demonstrate  this  effect. 
The  maximum  possible  power  conversion  efficiency  was 
found  to  approach  180%.  The  numerical  data  also  illus¬ 
trate  the  operation  transition  point  between  the  linear 
and  the  nonlinear  regimes,  which  is  important  in  the  de¬ 
sign  of  this  processor.  The  nonlinear  operation  was  also 
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Using  the  expression  for  the  instantaneous  intensity  dis¬ 
tribution,  we  find  the  total  energy  on  the  grating,  i.e., 


^in (x,y>  i0d*dyd£  b  £s  , 


and  by  setting  the  expression  equal  to  the  energy  of  the 
short  pulse  £s ,  we  may  express  the  field  amplitude  as  a 
function  of  the  pulse  energy  and  pupil  size,  yielding 

■  <A2> 

As  expected,  when  the  spatial  extent  increases  or  the 
pulse  duration  increases  for  fixed  pulse  energy,  the  in¬ 
stantaneous  field  amplitude  decreases. 
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APPENDIX  B:  VALIDITY  OF  THE  QUASI- 
STATIC  APPROXIMATION  FOR  WAVE 
MIXING  WITH  HIGH-RESOLUTION 
SPECTRALLY  DECOMPOSED  WAVES 

The  effect  of  group  velocity  mismatch  gives  rise  to  tempo¬ 
ral  walk-off  when  we  are  wave  mixing  ultrashort  pulses. 
Since  the  time-to-space  processor  performs  the  wave¬ 
mixing  operation  on  the  SDWs  of  the  pulses,  the  walk-off 
effect  is  greatly  reduced  since  the  bandwidth  at  every 
point  Ao>  is  smaller  than  the  total  bandwidth  and  the  du¬ 
ration  is  extended  [see  Eqs.  (21)  and  (22)].  When  the 
Gaussian  approximations  for  the  pulse  envelope  and  spa¬ 
tial  mode  (Section  2)  are  used,  the  bandwidths  is  UN  of 
the  total  bandwidth. 

Since  in  our  assumptions  the  signal  and  the  reference 
beams  have  the  same  spatial  mode  size  Lx ,  they  will  also 
have  the  same  bandwidth  at  any  point.  The  phase  mis¬ 
match  at  any  location  u  is  caused  by  the  group  velocity 
mismatch  and  is  bound  by  fiAo)  [see  Eq.  (24)],  where  p  is 
the  inverse  group  velocity  mismatch  and  Aoj  can  be  re¬ 
duced  by  increasing  Lx .  It  is  possible  to  define  a  quasi¬ 
static  interaction  length,  Lqs  =  l/(/3A<o ),  which  charac¬ 
terizes  an  effective  length  from  which  the  effect  of  group 
velocity  mismatch  is  perceptible.  If  the  interaction 
length  in  the  nonlinear  ciystal  is  less  than  the  quasi¬ 
static  interaction  length,  i.e.,  Lc  <  Lqs ,  then  we  can  ne¬ 
glect  the  effect  of  group  velocity  mismatch.  Thus  the 
nonlinear  wave-mixing  problem  can  be  treated  in  the 
quasi-static  approximation  without  a  partial  derivative 
with  respect  to  time  in  the  differential  equations  charac¬ 
terizing  the  wave-propagation  phenomena. 

With  the  assumption  of  a  high-resolution  SDW,  Lqs  can 
be  made  very  large,  and  we  can  use  the  known  quasi¬ 
static  solutions  obtained  for  the  case  of  monochromatic 
waves. 
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Abstract  The  instantaneous  response  time  of  parametric  op¬ 
tical  nonlinearities  enable  real-time  processing  of,  and  interaction 
between,  spatial  and  temporal  optical  waveforms.  We  review  the 
various  signal-processing  alternatives  based  on  three-  and  four- 
wave-mixing  arrangements  among  spatial  and  temporal  informa¬ 
tion  carrying  waveforms.  The  fast  response  time  of  the  interac¬ 
tion  permits  information  exchange  between  the  time  and  space  do¬ 
mains,  providing  the  ability  to  correlate  and  convolve  signals  from 
the  two  domains.  We  demonstrate  the  usefulness  of  real-time  signal 
processing  with  optical  nonlinearities  with  the  following  experi¬ 
ments:  converting  waveforms  from  the  time  to  space  domain  as 
well  as  from  the  space  to  time  domain,  spectral  phase  conjugation 
and  spectral  inversion  of  ultrafast  waveforms,  transmission  of  the 
spatial  correlation  function  on  an  ultrafast  waveform,  and  a  sug¬ 
gestion  for  a  single-shot  triple  autocorrelation  measurement. 

Index  Terms— Femtosecond  pulse  shaping,  optical  information 
processing,  optical  signal  processing,  space-to-time  conversion, 
time  reversal,  time-to-space  conversion,  ultrafast  processes. 


I.  Introduction 

HARNESSING  ultrashort  laser  pulse  technology  for  fu¬ 
ture  high-capacity  optical  communication  systems  may 
result  in  new  paradigms  for  information  transmission.  Ultra- 
short  pulses  can  be  used  as  a  basis  for  time-division  multiplexing 
in  a  fiber  communication  application,  with  the  potential  to  carry 
ultra-high-speed  data  [1],  [2],  The  broad  spectrum  of  ultrashort 
pulses  can  be  utilized  as  a  low-noise  multiple-channel  optical 
source  for  distribution  of  data  using  wavelength-division  mul¬ 
tiplexing  by  implementing  spectral  slicing  techniques  in  either 
the  time  [3]  or  the  space  [4]  domain.  Data  networking  applica¬ 
tions  may  benefit  from  the  asynchronous  property  of  code-di¬ 
vision  multiple  access  (CDMA),  which  can  be  performed  by 
spectrally  encoding  and  decoding  ultrashort  pulses  with  unique 
codes  assigned  to  all  users  of  the  network  [5],  [6],  More  sophis¬ 
ticated  data  modulation  formats,  such  as  ultrafast  pulse  packet 
transmission  on  designated  time  slots  [7]  and  hybrid  pulse  posi¬ 
tion  modulation  with  CDMA  encoding  [8],  can  result  in  perfor- 
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mance  gains  due  to  efficient  utilization  of  the  ultrashort  pulse 
bandwidth.  The  ultrafast  modulation  and  detection  processes 
that  these  techniques  require  cannot  be  accomplished  by  con¬ 
ventional  electronic  means,  due  to  the  limited  response  time  of 
electronic  components,  and  require  novel  real-time  optical  pro¬ 
cessors  based  on  instantaneous  nonlinear  phenomena. 

During  the  last  .few  years,  we  have  developed  real-time 
optical  processors  utilizing  nonlinear  wave  mixing  of  two 
or  three  input  waves  originating  from  spatial  or  temporal 
channels  for  synthesizing,  processing,  and  detecting  ultrafast 
waveforms.  These  devices  perform  real-time  optical  signal 
processing  that  can  be  applied  to  data  conversion  between 
slow  parallel  channels  and  an  ultrafast  serial  signal.  For  the 
data  modulation  application,  we  demonstrated  a  space-to-time 
mapping  processor  that  converts  spatial  frequency  information 
from  an  input  spatial  channel  to  the  temporal  frequency  content 
of  an  input  ultrashort  pulse  [9],  [10].  Using  a  four-wave-mixing 
arrangement  utilizing  cascaded  second-order  nonlinearities,  the 
output  channel  yields  an  ultrafast  waveform  that  is  a  time-scaled 
replica  of  the  input  spatial  image.  For  detection  of  ultrafast 
waveforms,  we  built  a  time-to-space  mapping  processor,  which 
mixes  two  spatially  inverted  temporal  frequency  information 
waves  in  a  three- wave-mixing  arrangement,  generating  a  spatial 
signal  carrying  the  temporal  image  for  detection  by  slower  elec¬ 
tronic  means  [11],  [12].  The  two  input  temporal  channels,  the 
first  carrying  the  desired  ultrafast  information  and  the  second  a 
reference  ultrashort  pulse,  yield  a  quasi-monochromatic  wave 
due  to  the  mixing  process  at  every  point  in  space  between  two 
spectral  components  that  result  in  a  constant  carrier  frequency 
(due  to  the  mutually  inverted  spectra  arrangement  in  the  space 
domain).  By  introducing  only  temporal  information  channels 
to  our  processor,  we  have  demonstrated  real-time  optical  pro¬ 
cessing  of  temporal  waveforms.  We  performed  time-reversal 
experiments  using  a  four-wave-mixing  arrangement  with 
information  carrying  ultrafast  waveform  and  two  reference 
ultrashort  pulses  [13].  The  time-reversal  experiments  were 
based  on  performing  spectral  phase  conjugation  and  spectral 
inversion  operations,  achieving  time  reversal  of  the  electrical 
field  and  of  the  complex  amplitude  waveform,  respectively. 
Spectral  phase  conjugation  is  an  important  feature  for  compen¬ 
sation  of  chromatic  dispersion  and  some  nonlinear  effects  of  an 
optical  fiber  communication  link  [14]. 

In  this  paper,  we  explore  the  different  real-time  signal-pro¬ 
cessing  capabilities  that  are  possible  with  three-  and 
four-wave-mixing  arrangements  of  spatial  and  temporal 
input  waveforms.  We  adopt  the  methodology  for  invention 
introduced  by  Lohmann  [15]  to  unravel  all  wave-mixing 
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Fig.  1.  Input  optical  waves  for  consideration  in  Fourier  plane  processing, 
(a)  Spectrally  decomposed  wave,  (b)  spatial  frequency  information,  and  (c) 
ultrashort  pulse. 

combinations  of  the  input  optical  signals.  A  table  is  constructed 
for  each  interaction  type,  with  columns  for  the  various  input 
waveforms,  leading  to  a  unique  output  signal  at  each  row.  We 
find  that  this  generalized  approach  describes  not  only  the  ex¬ 
periments  we  have  reported  but  also  experiments  conducted  by 
others  in  the  past  and  some  new  results  that  are  described  herein. 
We  demonstrate  the  transmission  of  the  correlation  function  of 
spatial  images  on  an  ultrafast  waveform  and  describe  a  mea¬ 
surement  technique  of  ultrashort  pulses  that  provides  the  triple 
correlation  values,  from  which  the  precise  pulse  intensity  could 
be  extracted. 

This  paper  is  organized  as  follows.  Section  II  defines  the  input 
spatial  and  temporal  waves  that  we  consider  for  interaction  via 
the  optical  nonlinearities.  We  limit  the  scope  of  this  study  to  in¬ 
teraction  at  the  Fourier  plane  of  a  4-F  system,  where  spatial  and 
temporal  frequency  information  appears.  The  resulting  output 
waveform  combinations  in  the  cases  of  three-  and  four-wave 
mixing  are  analyzed  in  Sections  III  and  IV,  respectively.  New 
results  are  described  in  Sections  V  and  VI,  followed  by  con¬ 
cluding  remarks. 

II.  Spatial  and  Temporal  Input  Waves 

The  spatial  and  temporal  input  waves  interact  by  the  non¬ 
linear  mixing  process  at  the  Fourier  plane  of  a  4-F  processor. 
The  4-F  arrangement  has  traditionally  been  used  for  coherent 
processing  of  spatial  images  [16]  and  modified  for  processing 
of  ultrafast  waveforms  by  introducing  diffraction  gratings  at  the 
input  and  output  planes  [17].  The  input  waves  that  are  to  be  con¬ 
sidered  in  the  wave-mixing  processing  are  (see  Fig.  1):  1)  spec¬ 
trally  decomposed  wave  (SDW),  i.e.,  spatially  dispersed  tem¬ 


poral  frequency  information;  2)  monochromatic  wave  carrying 
spatial  frequency  information;  and  3)  ultrashort  pulse  with  no 
spatial  information.  We  assume  that  all  input  waves  have  the 
same  center  angular  frequency  wq.  Furthermore,  the  informa¬ 
tion  contained  in  the  y-axis  is  omitted,  as  we  consider  one-di¬ 
mensional  (1-D)  spatial  images  only  and  the  temporal  informa¬ 
tion  is  inherently  1-D.  We  configure  the  input  optical  signals 
with  a  lateral  displacement,  for  convenience  as  well  as  for  sat¬ 
isfaction  of  the  noncollinear  phase  matching  requirement.  This 
lateral  shift  introduces  a  constant  linear  phase  component  in  the 
Fourier  plane,  which  is  omitted  here  for  brevity. 

The  SDW  is  generated  by  diffracting  the  ultrafast  waveform 
signal  p(t)  from  the  input  plane  grating,  at  an  incidence  angle  6 
relative  to  the  grating  normal,  such  that  the  diffracted  wave  at  the 
center  frequency  of  the  ultrashort  pulse  is  propagating  parallel 
to  the  optical  axis.  The  field  at  the  back  focal  plane  of  the  lens, 
or  the  Fourier  plane,  is  described  by  [12] 

x  exp(-jct?oO  (la) 

where 

jP(o>)  temporal  Fourier  transform  of  p(t); 

tu(«)  beam  projection  profile  on  the  input  diffraction  grating 
(or  pupil  function); 

a  grating’s  angular  dispersion  parameter  (a  =  sin(0)); 

F  lens  focal  length; 

c  speed  of  light  in  vacuum. 

Equation  (la)  describes  a  wave  of  finite  duration  (limited  by  the 
pupil  function),  whose  transverse  profile  (along  x')  carries  the 
temporal  spectrum  information,  with  a  rotating  wave- vector  in 
time.  It  represents  the  limiting  case  of  a  high-resolution  SDW, 
applicable  when  the  duration  of  the  ultrafast  waveform  is  much 
shorter  than  the  travel  time  across  the  pupil  function.  In  cer¬ 
tain  signal-processing  applications,  it  is  desirable  to  disperse  the 
spectra  in  the  opposite  direction.  For  this  case,  we  introduce  the 
ultrafast  waveform  from  the  opposite  size  (at  angle  — 0 ),  uti¬ 
lizing  the  opposite  diffraction  order,  resulting  in  the  SDW  field 

=«>(£)*’  (-f5r)  «P  (~^*) 

x  exp(-jw0t)-  (lb) 

We  assume  that  the  pupil  functions  of  the  SDWs  of  ( 1  a)  and  (lb) 
are  equal  and  even  functions,  resulting  in  an  identical  duration 
and  temporal  variation  for  the  two  functions.  Further  inspection 
shows  that  the  temporal  frequency  information  is  reversed,  as  is 
the  wave- vector  rotation  direction. 

The  spatial  information  wave  is  generated  by  a  1-D  mask 
m(x)  placed  at  the  input  plane  of  the  lens  and  illuminated  by  a 
monochromatic  light  source.  At  the  back  focal  plane  of  the  lens, 
we  observe  the  spatial  frequency  information,  given  by  [16] 

f-'spatia =  M  exp (~ju>ot)  (2) 

where  M(fx)  is  the  spatial  Fourier  transform  of  the  image  and 
Aq  is  the  wavelength. 
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TABLE  I 

Ultrafast  Signal-Processing  Alternatives  With  Three-Wave  Mixing 


Input  U\ 

Input  Ui 

Resultant  £/3 

Output  signal 

Comments 

SDW 

SDW 

SDW  with  doubled 
spatial  dispersion 

Ultrafast  waveform: 

convolution  of  input 

waveforms 

Output  at  doubled  carrier 
center  frequency 

SDW 

Inverted 

SDW 

Quasi- 

monochromatic 

wave 

Spatial  signal:  correlation  of 
input  waveforms 

When  one  input  wave  is 
transform  limited  — >  time- 
to-space  conversion 

Spatial 

frequency 

wave 

SDW  I 

Spatial 

frequency 

wave 

Spatial  frequency 
wave 

Spatial  signal:  convolution 
of  input  images 

Spatial  | 

1  frequency 
wave 

SDW 

Ultrafast  waveform: 

correlation  of  input 

waveform  and  spatial  image 

Output  at  doubled  carrier 

1  center  frequency 

Ultrashort 

pulse 

Ultrashort 

pulse 

Ultrashort  pulse 

Image  of  output  wave: 

1  correlation  of  input 

waveforms 

Standard  intensity  cross¬ 
correlation  measurement 

Spatial 

frequency 

wave 

Ultrashort 

pulse 

Ultrashort  pulse 

N.A. 

Can  be  used  for  frequency 
up-conversion 

SDW 

Ultrashort 

pulse 

Ultrashort  pulse 

Image  of  output  wave: 
convolution  of  input 

ultrafast  waveforms 

Can  be  used  for  time-to- 
space  conversion 

Finally,  we  also  consider  a  wave  that  carries  the  temporal  in¬ 
formation  directly  (as  opposed  to  the  temporal  frequency  infor¬ 
mation)  with  no  spatial  dependence,  defined  by 

tfrempO^;  t)  =  p(t)  exp(-jv0t).  (3) 

When  this  signal  is  utilized  with  a  real-time  nonlinearity,  it  per¬ 
forms  a  time  gating  functionality. 

The  waves  defined  in  (1) — (3)  interact  using  the  parametric 
processes  of  a  nonlinear  crystal.  We  seek  to  identify  the 
signal-processing  capabilities  that  are  enabled  by  the  ability 
to  generate  the  product  of  the  waves.  For  this  purpose,  the 
wave-mixing  crystal  is  thin,  such  that  phase  mismatch  and 
walkoff  effects  are  not  considered.  Depending  on  the  combi¬ 
nation  of  input  waves,  the  output  signal  will  be  in  either  the 
time  domain  or  the  space  domain.  It  is  further  assumed  that 
all  waveforms  arrive  at  the  processor  at  the  same  time  (i.e., 
no  time  delay  between  waveforms),  unless  otherwise  noted. 
In  a  three-wave-mixing  arrangement,  the  product  of  the  two 
waves  is  produced  by  a  noncollinear  frequency-sum  process. 
The  output  wave  will  be  at  a  doubled  carrier  frequency.  In 
the  four-wave-mixing  arrangement,  the  output  is  at  the  same 
carrier  frequency  due  to  the  degenerate  configuration.  We 
implemented  the  four- wave  mixing  by  a  cascade  of  three-wave 
processes,  frequency-sum  followed  by  frequency-difference 
in  a  noncollinear  type-II  phase-matching  arrangement  [18].  In 
this  arrangement,  the  output  signal  is  copropagating  with  one 
of  the  input  signals,  but  at  an  orthogonal  polarization  state.  The 
output  signal  can  be  extracted  with  a  polarization  beam  splitter. 
The  three-wave-processing  capabilities  are  developed  next. 

III.  Ultrafast  Processing  With  Three-Wave  Mixing 

In  a  three-wave-mixing  process,  two  input  waves  generate  a 
third  output  wave  that  is  proportional  to  the  product  of  the  two 
waves,  i.e.,  U$  oc  x^UiU Each  of  the  input  fundamental 
waves  U\  and  U2  can  have  the  form  of  any  one  of  the  three 


waves  of  (1M3).  Table  I  summarizes  all  the  fundamental  wave 
combinations  and  the  resulting  processing  achieved  at  the  output 
signal. 

A,  SDW-SDW  Mixing 

The  wave  produced  by  the  product  of  two  input  fundamental 
waves  of  the  form  of  (la)  is  given  by 


Applying  a  spatial  Fourier  transform  to  the  output  plane,  with 
the  Fourier  kernel  adjusted  for  the  new  carrier  frequency,  yields 

CW* f;t)  <xy  ^2  - — ^  exp(—j2u>0t) 

(5) 

where  y(t)  =  pi(t)  ®  p2(t)  and  ®  denotes  the  convolution  op¬ 
erator.  The  new  ultrafast  waveform  y(+)  scans  along  the  output 
plane  at  a  velocity  of  c/a.  If  the  signal  of  (5)  is  diffracted  from  a 
grating  with  a  spatial  frequency  that  is  double  that  of  the  grating 
in  the  input  plane  of  the  processor,  then  the  waveform  will  prop¬ 
agate  in  free  space.  The  doubled  spatial  frequency  is  required 
since  the  center  wavelength  has  been  halved.  The  new  waveform 
is  proportional  to  the  convolution  of  the  two  input  waveforms, 
resulting  from  the  information  exchange  among  the  temporal 
frequency  components  of  px{t)y  p2(t ),  and  y(t). 

B.  SDW-lnverted  SDW  Mixing 

The  wave  produced  by  the  product  of  two  mutually  inverted 
SDW  is  given  by 

***«  «(-!)«■(*)*(£)*(•  -£) 

X  exp(-j2u)0t).  (6) 
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Fig.  2.  Experimental  setup  for  time-to-space  conversion  using  mutually 
inverted  spectrally  decomposed  waves  with  three-wave  mixing.  Inset: 
experimental  result  showing  converted  image  from  an  ultrafast  pulse  sequence. 
Temporal  information  can  be  inferred  from  image  with  knowledge  of  scaling 
factor. 

Applying  a  spatial  Fourier  transform,  with  the  Fourier  kernel 
adjusted  for  the  new  carrier  frequency,  yields 

Uout(x";t)  oc  y  w  (“)  w  (/f)  exp(-j2u>0t) 

(7) 

where  y(t)  =  pi(t)  ©  P2{~t).  The  stationary  spatial  signal, 
y(*),  is  proportional  to  the  convolution  of  the  two  input 
waveforms,  where  one  of  them  is  reversed  in  time.  This 
mixing  process  was  developed  for  time-to-space  conversion 
by  using  a  reference  transform-limited  pulse  for  the  reversed 
waveform  [11],  as  illustrated  in  Fig.  2.  We  have  performed 
such  time-to-space  conversions  with  ultrashort  pulses  varying 
in  energy  levels  from  subnanojoules  (for  compatibility  with  op¬ 
tical  communication  applications  [19])  to  millijoules  with  both 
LBO  and  /3-barium  borate  (BBO)  crystals,  with  information 
conversion  efficiencies  as  high  as  120%  [12]  (the  conversion 
efficiency  can  exceed  100%  due  to  an  equal  energy  contribution 
from  the  reference  pulse).  The  high  conversion  efficiency  is 
possible  due  to  the  favorable  phase  matching  that  the  mutually 
inverted  SDWs  support  [20]. 

C.  Mixing  Spatial  Information  Channels 

The  wave  produced  by  the  product  of  two  spatial  frequency 
information  channels  is  given  by 

Uz{x':t)  =  Mi  Mo  (f~pj  exp(-j2w0t).  (8) 

Applying  a  spatial  Fourier  transform  yields 

Uout(x”\  t)  =  y(- 2xn)  exp(-j2u;ot)  (9) 

where  y{x)  =  mi(x)  ©  rri2(x).  The  output  spatial  signal  is  a 
convolution  of  the  input  images  and  is  a  typical  output  signal  in 
Fourier  optics  [16]  apart  from  the  scaling  factor  of  two,  caused 
by  the  frequency  doubling.  The  same  signal-processing  func¬ 
tionality  can  be  achieved  with  photorefractive  recording  media, 
as  the  fast  response  time  of  the  parametric  processes  is  not  re¬ 
quired  here. 


D.  SDW-Spatial  Wave  Mixing 

The  wave  produced  by  the  product  of  a  SDW  of  the  form  of 
(la)  and  a  spatial  frequency  information  wave  of  the  form  of  (2) 
is  given  by 

=»  (-£ )  ft  (]£?) 

x  exp  t'j  exp(—j2u)0t).  (10) 

Applying  a  spatial  Fourier  transform  to  the  output  plane,  with 
the  Fourier  kernel  adjusted  for  the  new  carrier  frequency,  yields 

Uout(x t)ocyft-  w  (ffj  exp(-;2wo<)  (11) 

where  y(t)  =  pi(t)  ®  m2(ct/a).  The  new  ultrafast  waveform 
$/(•)  scans  along  the  output  plane  at  a  velocity  of  c/2a.  Since 
both  the  scanning  velocity  at  the  output  and  the  center  wave¬ 
length  have  been  halved,  it  is  necessary  to  place  a  grating  with 
a  spatial  frequency  that  is  quadruple  that  of  the  grating  in  the 
input  plane  of  the  processor  for  the  waveform  to  propagate  in 
free  space.  The  new  waveform  is  proportional  to  the  convolu¬ 
tion  of  the  input  ultrafast  waveform  and  the  spatial  image,  when 
scaled  by  c/a  to  a  time-domain  variable.  Mixing  the  temporal 
frequency  content  of  an  ultrashort  pulse  pi  ( t )  with  the  spatial 
frequency  information  of  an  image  m2  (#)  can  therefore  be  used 
to  perform  pulse  shaping  for  the  output  waveform  y(t),  albeit  at 
a  converted  wavelength. 

E.  Mixing  Ultrashort  Pulses 

The  wave  produced  by  the  product  of  two  ultrashort  pulses  of 
the  form  of  (3),  while  allowing  for  a  timing  difference  r  between 
the  pulses,  is  given  by 

lh{xf:t)  =  pi(t)p2{t  +  r)  exp(— j2cc>of).  (12) 

Suppose  next  that  we  place  a  slow  detector  at  the  output  of  the 
crystal  to  measure  the  intensity  of  the  generated  second  har¬ 
monic  light.  The  instantaneous  intensity  that  is  incident  on  the 
detector  is 

Iz{x';t)  =  \U3{x':t)\2  =  \pi{t)p2{t  +  t)\2  =  h{t  )I2(t  +  T). 

(13) 

However,  due  to  the  slow  response  time  of  detector,  the  mea¬ 
surement  registers  the  accumulated  signal,  given  by 

h(r)=  J  h(t)I2(t  +  r)dt.  (14) 

The  output  signal  is  the  well-known  intensity  cross-correlation 
of  the  ultrafast  waveforms,  the  most  fundamental  short  pulse 
characterization  technique  [21],  [22].  Note  that  the  intensity 
cross-correlation  signal  is  observable  in  space  in  a  noncollinear 
arrangement  (by  imaging  the  output  light),  but  our  simplified 
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TABLE  H 

Ultrafast  Signal-Processing  Alternatives  With  Four-Wave  Mixing 


Input  U] 

Input  U2 

Input 

Resultant 

U4 

Output  signal 

Comments 

SDW 

SDW 

U 

Ultrafast  waveform: 

convolution  and  correlation 
among  input  waveforms. 

Spectral  phase 

conjugation  for  dispersion 
compensation 

SDW 

Inverted 

SDW 

Inverted 

SDW 

SDW 

Ultrafast  waveform: 

convolutions  among 

waveform,  inverted  waveform 
and  conjugated  waveform 

True  time-inversion  for 
complex  amplitude 

waveforms. 

SDW 

Spatial 

frequency 

wave 

Spatial 

frequency 

wave 

SDW 

Ultrafast  waveform: 

convolution  and  correlation 
among  input  waveform  and 
spatial  images 

Space-to-time  conversion 
when  one  spatial  channel 
is  delta  function 

Spatial 

frequency 

wave 

SDW 

SDW 

Spatial 

frequency 

wave 

Spatial  wave:  correlation  of 
temporal  waveforms 

convolved  to  spatial  wave 

Time-to-space  conversion 
with  dispersion 

compensation 

SDW 

Inverted 

SDW 

Spatial 

frequency 

wave 

Spatial 

frequency 

wave 

Spatial  wave:  convolution  of 
temporal  waveforms 

correlated  to  spatial  wave 

Spatial 

frequency 

wave 

Spatial 

frequency 

wave 

Spatial 

frequency 

wave 

Spatial 

frequency 

wave 

Spatial  wave:  convolution  and 
correlation  among  spatial 
images 

SDW 

Ultrashort 

pulse 

Ultrashort 

pulse 

Ultrashort 

pulse 

. 

Spatial  image:  Intensity  triple 
cross-correlation 

Exact  waveform  can  be 
extracted  from  triple 
correlation 

representation  of  (3)  does  not  take  into  account  the  space-time 
dependence  of  the  propagating  waveform. 

F.  Mixing  Spatial  Waves  With  Ultrashort  Pulses 

The  wave  produced  by  the  product  of  a  spatial  frequency  in¬ 
formation  wave  and  an  ultrashort  pulse  is  given  by 

U3(x';  t )  =  Mi  j  p2(t)  exp(-j2wo t).  (15) 

This  signal  has  no  significant  signal-processing  application  that 
comes  to  mind. 

G.  SDW-Ultrashort  Pulse  Mixing 

Mixing  an  SDW  with  an  ultrashort  pulse  introduces  a  time 
gating  on  the  wave,  resulting  in  the  output  second  harmonic 
wave 


x  exp(-j[2wof).  (16) 


Applying  a  spatial  Fourier  transform  to  the  output  plane,  with 
the  Fourier  kernel  adjusted  for  the  new  carrier  frequency,  yields 

Uout(z";t)  ocpi  - - — ^  P2{t)w  exp(—  j2uj0t). 

(17) 

We  wish  to  record  the  output  spatial  signal  by  placing  a  sensing 
device  that  integrates  the  output  intensity,  such  as  a  charge-cou¬ 
pled  device  (CCD)  camera.  The  observed  image  is  given  by 

4 ut(*")  =  J \Uonl(x"-t)\2dtK  J Ix(t-^pjl2(t)dt 

(18) 


where  we  assumed  that  the  long  aperture  duration  is  negligible 
due  to  the  short  duration  of  the  ultrafast  waveforms  (consis¬ 
tent  with  our  assumption  of  a  high-resolution  SDW).  The  image 
corresponds  to  the  intensity  cross-correlation  of  the  two  ultra¬ 
fast  waveforms,  converted  to  the  space  domain  and  permitting 
single-shot  measurement.  This  approach  was  used  for  one  of 
the  earliest  experiments  demonstrating  time-to-space  conver¬ 
sion,  albeit  with  a  slower  excitonic  nonlinear  process  in  a  ZnSe 
film  [23].  It  is  also  possible  to  perform  the  time-to-space  con¬ 
version  by  placing  the  nonlinear  crystal  at  the  image  plane  of  the 
grating  instead  of  at  the  Fourier  plane,  as  recently  demonstrated 
[24]. 

IV.  Ultrafast  Processing  With  Four- Wave  Mixing 

In  a  degenerate  four-wave-mixing  process,  three  input  waves 
at  the  same  carrier  frequency  generate  a  fourth  output  wave  that 
is  proportional  to  U4  oc  x^Ui  •  The  output  wave  is  also  at 
the  same  carrier  frequency  as  the  input  waves.  Each  of  the  input 
fundamental  waves  U± ,  U2  >  and  U3  can  have  the  form  of  any  one 
of  the  three  waves  defined  by  (1M3).  Table  II  summarizes  the 
input  wave  combinations  of  interest  and  the  resulting  processing 
achieved  at  the  output  signal. 

A.  SDW  Mixing 

The  wave  produced  by  the  three  input  fundamental  waves  of 
the  form  of  (la)  is  given  by 

(  .  \ 

x  exp  V  ~aF  f )  exP(~^ot)- 


(19) 
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Fig.  3.  Experimental  setup  for  time  reversal  using  spectral  phase  conjugation 
with  four-wave  mixing.  Inset:  experimental  result  showing  time  reversal  of 
pulses  A  and  B  and  chirp  flipping  (dashed  line  corresponds  to  simulated  phase 
function). 


Fig.  4.  Experimental  setup  for  time  reversal  using  spectral  inversion  with 
four-wave  mixing.  Inset:  experimental  result  showing  time  reversal  of  pulses 
A  and  B  and  preservation  of  chirp  sign  (dashed  line  corresponds  to  simulated 
phase  function). 


Applying  a  spatial  Fourier  transform  yields 


Applying  a  spatial  Fourier  transform  yields 

*4ut  (*";*)  OC  y(t-  u’3  exp(-juot)  (20) 

where  y(t)  =  pi(t)  ®p2(t)  ®  p^-t)  =  p±(t)  ®p2(t)  *Pz{t) 

and  *  denotes  the  correlation  operation.  The  new  ultrafast  wave¬ 
form  ?/(•)  scans  along  the  output  plane  at  a  velocity  of — c/a  and 
can  be  coupled  to  ffee-space  propagation  by  diffraction  from  a 
grating  identical  to  the  grating  at  the  input  plane  of  the  processor. 
The  new  waveform  is  proportional  to  the  convolution  of  the  first 
and  second  input  waveforms  with  the  third  waveform,  which  is 
time-reversed  and  conjugated.  A  convolution  with  the  time-re¬ 
versed  and  conjugated  signal  is  equivalent  to  a  correlation  oper¬ 
ation.  Thus,  the  output  waveform  is  a  combination  of  a  convo¬ 
lution  and  correlation  among  the  input  waveforms.  If  Pi(t)  and 
p2(t)  are  both  transform-limited  pulses,  the  output  spectra  is  a 
phase  conjugate  of  the  spectra  of  pz{t),  which  is  an  important 
application  for  dispersion  compensation  in  optical  fiber  com¬ 
munication  links.  We  have  performed  the  spectral  phase  conju¬ 
gation  experiment  (see  Fig.  3)  with  a  signal  wave  consisting  of  a 
pulse  pair:  a  chirped  pulse  followed  by  a  transform  limited  pulse 
[13].  As  expected,  the  two  pulses  exchanged  location,  while  the 
chirped  pulse  also  reversed  its  chirp  sign.  All  pulses  were  ob¬ 
tained  from  a  millijoule  source  pulse  and  processed  in  a  BBO 
crystal  with  cascaded  second-order  nonlinearities. 

B.  SDW  and  Inverted  SDW  Mixing 

There  are  several  options  for  choosing  which  channels  to  in¬ 
troduce  to  the  processor  from  the  opposite  side,  such  that  their 
corresponding  SDW  will  be  inverted.  When  the  SDWs  of  waves 
Un  and  U3  are  inverted,  the  wave  produced  by  the  three  input 
fundamental  waves  is  given  by 


x  p;  (~Ff)  exp  (j  FF  *)  exp(“^’0°- 

(21) 


Uoul(x":t)  oc  y(t-  w  )  exp(-ju>0t) 

V  C  °  (22) 

■where  y(t)  =  pi(t)®p2(-t)®pl{t)  =  Pi(t)<S>p2(-t)*p3(-t). 

The  unique  feature  of  this  arrangement  is  that  the  output  wave¬ 
form  is  proportional  to  the  time-reversed  complex  amplitude 
waveform  of  the  second  channel.  We  have  demonstrated  this 
property  for  the  first  time  [13],  to  the  best  of  our  knowledge,  by 
using  transform-limited  pulses  for  pi(£)  and  p2(t).  The  signal 
wave  consisted  of  a  pulse  pair:  a  chirped  pulse  followed  by  a 
transform- limited  pulse.  As  expected,  the  two  pulses  exchanged 
location,  while  the  chirped  pulse  preserved  its  chirp  sign  (see 
Fig.  4). 

C.  SDW  and  Spatial  Wave  Mixing  I 

The  wave  produced  by  the  interaction  of  an  SDW  with  two 
spatial  frequency  information  waves  is  given  by 


x  exp  fj  tj  exp(-ju)ot).  (23) 


Applying  a  spatial  Fourier  transform  yields 

Uou t(x";  t)  cx  y  (t  -  w  exp(-jw0f)  (24) 

where  y(t)  =  pi(£)  ©  rri2{ct/a)  ©  m$(—ct/a)  =  pi(t)  © 
rri2(ct/a)  *  m^(ct/a).  The  output  ultrafast  waveform  carries 
the  correlation  information  of  the  two  spatial  channels.  Such 
a  configuration  can  be  useful  for  performing  a  spatial  correla¬ 
tion  function  and  transmitting  the  correlation  information  to  a 
remote  site  on  a  temporal  channel.  The  spatial  correlation  fea¬ 
ture  was  demonstrated  and  is  discussed  in  Section  V.  When 
one  of  the  spatial  channels  is  a  delta  function,  the  result  is  a 
space-to-time  conversion  [9],  [10]  (see  Fig.  5).  All  the  input 
optical  signals  were  generated  from  a  single  millijoule  source 
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pulse;  the  pulse  was  stretched  with  a  grating  pair  for  imple¬ 
menting  the  two  monochromatic  spatial  channels. 

D.  and  Spatial  Wave  Mixing  11 

A  different  arrangement  of  the  input  interacting  waves  pro¬ 
duces  the  equivalent  of  real-time  time-to-space  conversion,  sim¬ 
ilar  to  that  achieved  with  spectral  holography  [25].  The  wave 
produced  by  the  interaction  of  a  spatial  wave  with  two  SDWs  is 
given  by 


x  exp(-ju)ot).  (25) 


Fig.  5.  Experimental  setup  for  space-to-time  conversion  with  four-wave 
mixing.  Inset:  experimental  result  showing  conversion  of  spatial  information 
(regularly  placed  slits)  to  an  ultrafast  waveform  consisting  of  a  pulse  sequence 
at  1.56-THz  repetition  rate. 


Applying  a  spatial  Fourier  transform  yields 

flout  fa";  t)  oc  y(~x")w2  exp(—ja/oO  (26) 

where  y(x)  =  mi(x)  ®p2(ax/c)  *pz(ax/c).  The  stationary 
output  signal  carries  the  information  of  the  correlation  of  the 
two  waveforms,  scaled  to  a  spatial  dependence,  convolved  with 
the  spatial  signal.  Such  a  configuration  can  be  useful  for  disper¬ 
sion  cancellation  when  two  waveforms  are  transmitted  along  the 
same  path  [7].  When  the  input  spatial  signal  carries  no  informa¬ 
tion  (delta  function),  the  output  signal  results  in  a  time-to-space 
conversion  by  four-wave  mixing,  achieving  the  same  result  as 
holographic  four-wave  mixing,  albeit  in  real-time. 

E.  SDW,  Inverted  SDW,  and  Spatial  Wave  Mixing 

It  is  possible  to  mix  SDW  and  get  a  spatial  output  by  wave 
mixing  an  SDW,  an  inverted  SDW,  and  a  spatial  frequency  in¬ 
formation  wave.  The  resulting  wave  is  given  by 

)•(!)*(#)*(-£) 

X  exp(-ju>0t).  (27) 


Applying  a  spatial  Fourier  transform  yields 

Uout(x t)  =  y(-x”)  exp(-ju;oO  (30) 

where  y(x)  =  m^x)  ®  m2(x)  ©  ml(x)  =  m^x)  ©  m2(x)  * 
ms(x).  The  convolution  and  correlation  relationship  among  the 
input  spatial  images  is  typical  in  holographic  signal  processing. 
Utilizing  nonlinear  wave  mixing  yields  the  same  results  in  real 
time. 

G.  SDW  and  Ultrashort  Pulse  Mixing 

A  time-delay  parameter  is  added  to  one  of  the  ultrashort 
pulses  for  describing  the  interaction  of  an  SDW  with  two 
ultrashort  pulses.  The  output  wave  is  therefore  given  by 


X  P2(t)ps(t  -  to)  exp(-jwot).  (31) 

Applying  a  spatial  Fourier  transform  yields 

U0ut{x"-.t)  =W  Pi  (*  ~  -—'j  P2(t)p*3(t  -  t0) 

x  exp(-ju>0t).  (32) 


Applying  a  spatial  Fourier  transform  yields 

f)exp(-^) 

(28) 

where  y(t)  =  p1{t)®p2{-t)®m%{-ct/a)  =  pi(t)®p2(-t)* 
m3(ct/a).  The  stationary  output  signal  carries  the  information 
of  the  convolution  of  the  two  waveforms  (where  one  is  reversed 
in  time)  and  a  correlation  with  the  spatial  signal.  Such  a  config¬ 
uration  can  be  useful  to  correlate  an  incoming  waveform px(t) 
with  a  spatial  image  and  requiring  a  spatial  output  for  subse¬ 
quent  detection  (by  using  a  featureless  reference  pulse  for  the 
second  temporal  channel). 


F  Spatial  Wave  Mixing 

Three  spatial  frequency  information  waves  interacting  via  de¬ 
generate  four-wave  mixing  generate  an  output  wave  given  by 


x  exp (~ju>0t). 


(29) 


The  observed  image  on  a  slow  detection  device  such  as  a  film 
or  camera  is  described  by 

SOut(x":t0) 

/OO 

\Uout(x";t)\2dt 

■OC 

~  u’2(0)  j  h  (t-  I2(t)h(t  -  t0)  dt  (33) 

where  we  used  the  approximation  that  the  field  strength  varia¬ 
tion  due  to  the  spatial  beam  mode  does  not  change  for  the  short 
duration  of  the  sampling  pulse.  The  image  contains  the  triple 
cross-correlation  information  of  the  three  input  ultrashort  wave¬ 
form  intensities.  The  triple  correlation  requires  two  time-lag  pa¬ 
rameters.  In  the  result  of  (33),  one  time-lag  parameter  is  mapped 
to  the  spatial  domain,  while  the  other  is  set  by  the  relative  time 
delay  between  the  second  and  third  ultrafast  waveforms.  If  all 
the  input  waveforms  to  the  processor  are  identical,  then  the  triple 
autocorrelation  data  can  be  used  to  extract  the  exact  intensity 
profile  of  the  input  short  pulse.  To  reconstruct  the  input  wave¬ 
form  intensity  profile,  the  complete  two-dimensional  (2-D)  cor- 
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Fig.  6.  Experimental  setup  for  the  information  exchange  among  two  input 
spatial  channels  and  an  input  ultrafast  waveform.  The  output  ultrafast  waveform 
contains  the  correlation  information  of  the  two  spatial  channels. 

relation  matrix  is  required  [26].  Using  this  arrangement  would 
require  a  sequence  of  measurements,  varying  the  parameter  fo 
for  each  measurement.  A  single-shot  variant  of  this  technique  is 
described  in  Section  VI. 

V.  Spatial  Correlation  Function  Carried  on  an 
Ultrafast  Waveform 

Mixing  spectrally  decomposed  waves  and  spatially  Fourier 
transformed  images  results  in  an  information  exchange  be¬ 
tween  the  temporal  spectra  of  the  ultrashort  pulse  and  the 
spatial  spectra  of  the  image.  The  information  exchange  results 
in  waveform  synthesis  by  a  space-to-time  conversion.  A 
three-wave-mixing  arrangement  for  the  information  exchange 
results  in  a  temporal  signal  with  a  doubled  center  frequency. 
When  the  synthesized  waveform  is  required  to  have  an  identical 
center  frequency,  a  degenerate  four-wave-mixing  arrangement 
is  utilized,  with  a  second  spatial  signal  consisting  of  a  (feature¬ 
less)  point  source.  In  this  section,  the  information  exchange 
between  the  two  spatial  images  and  the  ultrafast  waveform  is 
demonstrated  (see  Fig.  6). 

Mixing  an  SDW  and  two  spatially  Fourier  transformed  im¬ 
ages  using  a  degenerate  four-wave-mixing  process  results  in  an 
output  temporal  waveform  that  is  a  convolution  of  the  input  tem¬ 
poral  waveform  with  the  correlation  of  the  two  spatial  images, 
properly  scaled  to  a  time-domain  representation,  i.e.,  y(t)  — 
Pi(t)®m2(ct/a)*niz(ct/a).  In  the  experiments,  a  commercial 
laser  system  consisting  of  a  Ti :  Sapphire  ultrashort  pulse  oscil¬ 
lator  and  a  regenerative  amplifier  was  used.  The  system  gener¬ 
ates  ultrashort  laser  pulses  of  100  fs  duration  at  a  center  wave¬ 
length  of  800  nm  with  an  energy  level  of  1  mJ  per  pulse.  Ninety 
percent  of  the  emitted  output  pulse  power  was  utilized  for  the  in¬ 
tense  quasi-monochromatic  light  source  required  by  the  spatial 
channels  and  stretched  to  a  duration  of  several  picoseconds  by  a 
grating  pair.  The  stretched  pulse  was  split  into  two  beams  for  im¬ 
plementing  the  two  quasi-monochromatic  spatial  channels.  The 
remaining  10%  of  the  short  pulse  laser  output  power  was  used 
as  the  reference  ultrashort  pulse  in  the  input  temporal  channel. 
The  SDW  Ui  is  generated  by  a  600-lines/mm  blazed  grating  that 
provides  an  angular  dispersion  parameter  of  a  =  0.48  and  a 
lens  of  F  =  375  mm  focal  length.  The  four-wave  mixing  is  per¬ 
formed  by  cascaded  second-order  nonlinearities  in  a  2-mm-long 
type-II  BBO  crystal.  Several  experiments  were  conducted  to 
illustrate  this  real-time  spatial-temporal  processing  technique. 
All  the  output  waveforms  were  observed  with  a  time-to-space 


Time  in  picoseconds 


Fig.  7.  Ultrafast  waveforms  containing  the  correlation  information  of  two 
rectangular  apertures  in  the  spatial  domain.  Solid  curve:  ultrafast  waveform 
from  two  narrow  rectangular  spatial  signals  of  identical  width.  Dashed  curve: 
wider,  unequal  rectangular  spatial  signals,  resulting  in  a  trapezoid  correlation 
function. 

converter  based  on  three-wave  mixing  with  mutually  inverted 
SDW.  The  reference  pulse  source  required  for  the  pulse  imager 
was  the  residual  input  pulse  of  the  spatial-temporal  processor, 
after  separating  it  from  the  output  waveform  with  a  polarizing 
beam  splitter.  The  output  signal’s  intensity  was  viewed  with  a 
CCD  camera,  and  the  temporal  information  was  extracted  from 
the  image. 

The  first  experiment  consisted  of  placing  two  variable  slits  in 
the  spatial  channels,  implementing  a  rectangular  spatial  distri¬ 
bution.  The  correlation  function  of  two  rectangular  signals  has 
a  trapezoidal  shape;  the  plateau  is  observed  for  correlation  de¬ 
lays  smaller  than  the  width  of  the  narrower  rectangle.  When  the 
two  rectangles  are  of  identical  width,  the  correlation  function 
is  triangular.  If  the  width  of  one  of  the  rectangles  is  very  small 
(approaching  a  point  source),  then  the  output  signal  will  result 
in  a  space-to-time  conversion  and  resemble  a  square  pulse  (gen¬ 
eration  of  square  pulses  using  this  technique  reported  in  [10]). 
The  ultrafast  output  waveform  will  contain  smoothed  features 
due  to  the  convolution  operation  with  the  input  ultrashort  pulse. 
Additional  smoothing  is  performed  in  the  waveform  imaging 
operation.  The  experimental  results  of  Fig.  7  precisely  illus¬ 
trate  the  correlation  function  of  rectangular  signals  described 
above.  A  triangular  pulse  shape  was  observed  for  equal  rectan¬ 
gular  widths  (solid  curve),  while  a  trapezoidal  pulse  shape  was 
observed  for  unequal  rectangular  widths  (dashed  curve). 

A  second  experiment  was  performed  with  spatial  channels 
that  are  coded  with  pseudorandom  masks.  The  phase  mask  (for 
implementing  values  of  ±1)  was  prepared  by  wet-etching  the 
spatial  information  into  a  quartz  substrate.  The  spatial  informa¬ 
tion  consisted  of  contiguous  equal-width  rectangular  sections 
(each  75  pm)  encoded  with  transmission  values  of  ±  1  according 
to  a  127  maximal  length  sequence  [see  Fig.  8(a)].  The  max¬ 
imal  length  sequence  was  chosen  for  the  low  correlation  values 
it  exhibits  for  any  delay  other  than  zero.  When  the  mask  was 
placed  in  one  spatial  channel  and  the  second  consisted  of  a 
point  source,  a  space-to-time  conversion  was  performed  and 
the  resulting  waveform  appeared  random  [see  Fig.  8(b)].  When 
both  spatial  channels  had  an  identical  mask,  a  correlation  peak 
at  the  zero  delay  location  appeared  [see  Fig.  8(c)].  At  other 
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Fig.  9.  Ultrashort  pulse  trains  with  different  periods  to  generate  a  single-shot 
triple  correlation  measurement. 


pulses.  In  this  section,  a  single-shot  technique  based  on  ultra- 
short  pulse  trains  for  sampling  the  SDW  is  described. 

We  modify  the  technique  described  above  by  using  two  ul¬ 
trashort  pulse  trains  instead  of  the  two  ultrashort  pulses.  Gen¬ 
erating  a  pulse  train  of  identical  ultrashort  pulses  is  possible 
with  waveform  synthesis  techniques.  Additionally,  all  pulses  are 
assumed  to  have  an  identical  waveform.  The  period  between 
consecutive  pulses  is  different  for  the  two  pulse  trains.  Let  one 
pulse  train  have  pulse  separation  of  Ai  and  the  other  A2,  and 
let  Ai  %  A2  (see  Fig.  9).  The  resulting  output  field  is  given  by 
[see  (32)] 

UoxlX.{x":  t)  =  w 

x  -  mA2)j  exp(-jui0t).  (34) 

Since  the  pulse  separations  Ai  and  A2  are  nearly  equal,  Uo  ut 
will  be  zero  if  n  ^  m.  Therefore,  we  can  rewrite  (34)  as 


Fig.  8.  Ultrafast  spatial-temporal  processing  with  pseudorandom  information, 
(a)  Maximal  length  sequence  used  for  spatial  mask  information,  (b)  Synthesized 
space-to-time  conversion  signal  from  mask  using  a  point  source  for  second 
channel,  (c)  Correlation  function  of  two  identical  masks  carried  on  a  temporal 
waveform. 


t4ut(R/;  t)  =  ^2  w  p(t-  p(*  ~  n^i) 

x  p*(t  -  7iA2)  exp (~ju>0t).  (35) 


delay  times,  the  ultrafast  waveform  had  a  small  signal  from  the 
nonzero  correlation  values. 

This  type  of  processor  can  be  used  for  transmitting  spatial 
correlation  information  via  a  temporal  channel.  Such  capability 
may  be  important  for  transmission  of  information  from  a  hostile 
environment  for  detection  at  a  safer  location. 


VI.  Single-Shot  Implementation  of  the  Triple 
Autocorrelation 


The  observed  image  on  a  slow  detection  device  such  as  a  film 
or  camera  is  then  described  by 


Sout(R')  =  r  \Uout(x";t)\2dt 


Wave  mixing  an  SDW  and  two  ultrashort  pulses  resulted  in 
an  output  plane  image  that  contained  some  of  the  triple  correla¬ 
tion  values,  as  developed  in  Section  IV.  For  reconstruction  of  the 
original  intensity  profile  of  the  ultrashort  pulse,  the  entire  cor¬ 
relation  matrix  is  required.  Therefore,  a  sequence  of  measure¬ 
ments  is  necessary  to  fully  characterize  the  ultrashort  pulse  in¬ 
tensity.  The  parameter  that  is  changed  between  subsequent  mea¬ 
surements  is  the  relative  time  delay  between  the  two  ultrashort 


x  p(t  —  nAi)p*(t  —  n A2) 


s-t-SM*-2?) 

x  p*(t-  kAi)p(t  -  kA2)  J  dt.  (36) 
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Again,  since  the  pulse  separations  Ai  and  A2  are  nearly  equal, 
Sout  will  be  zero  if  n  ^  k.  Therefore 


Sout(x") 


=  V  f 


^  - 

n  J 

x  I(t-  nA2)dt 
cnAi 


w2  (  -- |  lit-  aX" 
a 


Iit-nAx) 


w~ 


)i:a -*) 


x  J(£  —  nAi)I(t  —  nAo)  dt 


(37) 


where  we  used  the  approximation  that  the  field  strength  varia¬ 
tion  due  to  the  spatial  beam  mode  does  not  change  for  the  short 
duration  of  each  sampling  pulse  at  time  t  =  n  Ai .  Let  us  assume 
that  A2  =  Ai  4-  e.  Additionally,  with  a  change  of  integration 
variable,  (37)  can  be  rewritten  as 


«*■)  =  £*'(—)  £/(?+nA,-2f) 


x  !(£)/(£  -  ne)  dt. 


(38) 


The  integral  in  (38)  is  now  identifiable  as  the  triple  autocorrela¬ 
tion  integral,  where  one  lag  parameter  is  mapped  to  the  spatial 
domain  while  the  second  lag  parameter  is  ne.  Each  order  of  n 
is  mapped  to  a  different  location  on  the  spatial  output  (centered 
at  xn  =  cn Ai /a)  and  is  associated  with  the  lag  he.  There¬ 
fore,  the  triple  correlation  matrix  values  are  raster  scanned  on 
the  1-D  output  space.  Each  order  n  is  also  scaled  by  the  input 
pupil  function.  This  effect  can  be  accounted  for  by  measuring 
the  profile  and  normalizing  the  measurement  or  by  ensuring  that 
the  mode  is  uniform.  After  collecting  the  1-D  correlation  values 
and  assembling  the  matrix,  an  algorithm  can  be  applied  to  find 
the  exact  intensity  profile  of  the  ultrashort  pulse.  The  sampling 
method  described  here  can  be  associated  with  bandwidth  adap¬ 
tation  techniques  [27],  where  a  2-D  distribution  is  converted  to 
a  1-D  distribution.  Such  bandwidth  adaptation  techniques  have 
also  been  applied  to  time-to-2-D  space  and  2-D  space-to-time 
conversions  [28]. 


VII.  Discussion  and  Conclusion 

We  have  examined  the  different  signal-processing  capabil¬ 
ities  that  are  enabled  when  spatial  and  temporal  information 
bearing  waves  interact  via  fast  parametric  processes.  The  me¬ 
thodical  process  of  investigating  all  the  input  wave  combina¬ 
tions  describes  the  experiments  we  have  conducted  in  the  past 
and  reveals  new  options  for  processing  of  ultrafast  data.  These 
techniques  can  be  used  for  synthesis,  processing,  and  detec¬ 
tion  of  ultrafast  waveforms.  Such  capabilities  may  be  utilized  in 
future  high-capacity  communication  systems,  ultrafast  optical 
computation,  and  scientific  uses  in  investigation  of  ultrafast  phe¬ 
nomena  and  quantum  control. 

As  our  analysis  showed,  there  are  many  methods  one  may 
choose  for  conversion  of  data  from  the  time-to-space  domain: 
time-domain  noncollinear  arrangement,  mixing  the  time  do¬ 
main  with  the  SDW  (or  at  the  image  plane  of  the  grating),  and 
mixing  two  mutually  inverted  SDWs.  All  methods  generate  the 


intensity  cross-correlation  signal  with  a  reference  transform 
limited  pulse  [compare  (18)  and  the  intensity  of  (7),  for 
example].  The  different  arrangements  offer  functional  trade¬ 
offs  among  time  window  of  apparatus,  phase  matching,  and 
walkoff  effects;  crystal  (or  interaction)  length;  and  conversion 
efficiency.  However,  it  was  not  within  the  scope  of  this  paper 
to  analyze  which  technique  offers  the  best  performance  for  a 
given  set  of  constraints. 

The  instantaneous  nature  of  wave-mixing  processes  for 
information  conversion  from  the  spatial  domain  to  the  temporal 
domain  is  best  utilized  when  the  spatial  information  channel 
is  rapidly  changing.  In  our  analysis,  we  have  assumed  that  the 
spatial  information  is  stationary  with  respect  to  the  temporal 
channel  time  window.  This  could  be  accomplished  by  a  1-D 
fiber  array,  which  supplies  synchronized  parallel  data  streams 
(for  such  telecom-oriented  applications,  the  wave-mixing 
process  has  to  be  optimized  for  the  expected  low  power  levels). 
If  the  spatial  channel  information  changes  during  the  time 
window  of  the  spectral  processor,  the  output  signal  will  exhibit 
different  temporal  information  across  its  transverse  extent.  It 
should  be  noted  that  when  the  spatial  information  is  varying  at 
slow  rates,  alternative  Fourier  plane  processing  elements  such 
as  photoreffactives  may  be  employed  (spectral  holography 
[29]). 

Ultrafast  waveform  synthesis  by  space-to-time  conversion 
can  be  performed  by  three-  and  four-wave-mixing  arrange¬ 
ments.  The  degenerate  four-wave-mixing  arrangement  is  the 
more  useful  option,  as  it  is  usually  desirable  to  modify  an  input 
ultrashort  pulse  to  a  prescribed  waveform  without  a  center 
frequency  shift.  However,  since  all  the  processing  is  performed 
by  real-time  parametric  interaction,  it  is  possible  to  add  a  wave- 
length-tuning  feature.  When  utilizing  the  four-wave-mixing 
process  with  tunable  lasers  for  implementation  of  the  spatial 
channels,  the  center  frequency  of  the  synthesized  waveform 
will  be  shifted  by  the  frequency  difference  of  the  two  spatial 
lasers,  in  accordance  with  the  energy  conservation  principle. 
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Design  and  Evaluation  of  the 
Fibonacci  Optical  ATM  Switch 

Allen  Shum,  P.M.  Melliar-Smith,  and  Louise  E.  Moser 

Abstract— An  ATM  switch  architecture  that  uses  optical  delay  fibers  as  buffers  is  presented.  Based  on  the  remarkable  properties  of 
the  Fibonacci  numbers,  the  architecture  requires  few  fibers  and  yields  low  cell  loss  rates  and  latencies  across  diverse  time-correlated 
and  heavily  loaded  traffic  profiles.  Moreover,  the  control  algorithm  of  the  architecture  is  computationally  efficient  and  can  be  executed 
in  real  time.  Simulation  studies  indicate  that  the  cell  loss  rate  decreases  exponentially  as  the  number  of  fibers  increases  and  that  the 
probability  mass  function  for  the  cell  latency  decays  rapidly  as  the  latency  increases. 

Index  Terms— Asynchronous  transfer  mode,  optical  ATM  switch. 

-  ♦  - 


1  Introduction 

oday'S  electronic  ATM  switches  are  the  bottleneck  of  an 
ATM  network:  ATM  cells  are  not  switched  fast  enough 
to  match  the  transmission  speed  of  the  optical  channels.  It  is 
expected  that  the  gap  between  the  electronic  switching  and 
optical  transmission  speeds  will  continue  to  increase  as 
optical  transmission  technology  matures.  A  solution  to  this 
problem  is  to  implement  ATM  switches  using  optical 
technology. 

The  major  functions  of  an  ATM  switch  are  to  route 
incoming  cells  to  the  appropriate  output  channels  and  to 
provide  buffering  for  them,  if  necessary.  The  construction  of 
a  single  monolithic  fabric  to  interconnect  large  numbers  of 
input  and  output  channels  may  not  be  feasible  due  to 
physical  and  cost  contraints.  Consequently,  the  switching 
fabric  of  a  high-performance  ATM  switch  typically  consists 
of  a  network  of  smaller  switching  elements,  as  shown  in 
Fig.  1,  with  each  small  switching  element  having  its  own 
buffers,  optical  crossbar  switch,  and  control  unit,  as  shown 
in  Fig.  2.  Such  an  organization  lends  itself  naturally  to 
parallel  processing  of  cells  and  parallel  routing  decisions. 

In  optical  switches,  static  buffering  is  not  feasible  and 
buffers  are  constructed  by  switching  ATM  cells  into  optical 
fibers  that  act  as  delay  lines.  The  longer  the  fiber,  the  longer 
the  delay  and  the  greater  the  number  of  cells  buffered  in  the 
fiber.  By  switching  an  ATM  cell  through  a  sequence  of  fiber 
delay  lines,  any  required  delay  can  be  achieved.  The 
difficulty  of  building  an  optical  crossbar  switch  increases 
rapidly  with  the  number  of  fibers.  Consequently,  it  is 
important  to  minimize  the  number  of  fiber  delay  lines  while 
still  achieving  efficient  operation  of  the  switch.  Although 
packet  switch  architectures  using  optical  delay  fibers  as 
buffers  have  been  proposed  in  the  literature  [4],  [5],  [7],  [10], 
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[12],  as  far  as  we  can  determine,  studies  on  how  to  design 
an  optical  switch  that  addresses  ATM-specific  requirements 
while  requiring  few  fibers  and  low  processing  complexity 
do  not  exist. 

In  this  paper,  we  present  the  design  of  a  novel  optical 
ATM  switch  architecture.  The  Fibonacci  buffer  organization 
of  the  switch  uses  fewer  fibers  to  achieve  lower  cell  loss 
rates  under  diverse  time-correlated  and  heavily  loaded 
traffic  profiles  than  existing  buffer  organizations.  Moreover, 
the  control  algorithm  is  computationally  efficient  and  can 
be  executed  in  real  time.  The  architecture  guarantees  that 
the  cell  sequence  integrity  of  each  input-output  channel  pair 
is  maintained  and  that  the  maximum  latency  is  below  a 
specified  threshold. 

This  paper  is  organized  as  follows:  In  Section  2,  we 
identify  the  issues  and  challenges  in  designing  an  optical 
ATM  buffer  architecture.  In  Section  3,  we  present  our 
optical  ATM  switch  architecture,  followed  by  an  analysis  of 
the  time  and  space  complexity  of  the  design.  In  Section  4, 
we  present  the  performance  results  of  the  architecture,  as 
indicated  by  simulation  studies,  and  also  a  heuristic  that 
can  determine  an  empirically  lossless  optical  ATM  switch. 
In  Section  5,  we  provide  a  discussion  of  related  work  and,  in 
Section  6,  we  summarize  the  results  of  our  work. 

2  Problem  Statement 

In  an  optical  ATM  switch,  the  buffering  of  a  cell  for  k  time 
slots  is  accomplished  by  causing  the  cell  to  traverse  a 
sequence  of  optical  fibers  such  that  the  propagation  delay 
over  the  fibers  is  exactly  k  time  slots.  A  fiber  is  of  length  i  if 
the  time  required  to  propagate  a  cell  through  the  fiber  is 
exactly  i  time  slots. 

The  three  components  of  an  ATM  buffer  organization  are 
the  buffer  set,  the  switching  fabric,  and  the  control 
algorithm.  The  buffer  set  refers  to  the  set  of  optical  delay 
fibers  used  for  buffering.  The  switching  fabric  is  the  structure 
that  physically  routes  cells  from  the  input  to  the  output, 
e.g.,  a  crossbar.  The  control  algorithm  is  the  procedure  that 
determines  how  cells  are  stored  or  propagated  through  the 
switching  fabric. 
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Fig.  1 .  A  4-input  and  4-output  switch  fabric  with  two  stages  of  2  x  2 
switching  elements  and  an  8-input  and  8-output  switch  fabric  with  three 
stages  of  2  x  2  switching  elements. 

Cell  loss  refers  to  the  discarding  of  a  cell  when  a  sequence 
of  fibers  to  buffer  the  cell  cannot  be  found.  Latency,  in  our 
context,  refers  to  the  queuing  delay  that  cells  may 
experience  at  the  switch.  An  arrival  process  or  traffic  profile 
describes  the  arrivals  of  cells  at  the  input  channels  and  at 
their  destination  output  channels. 

Because  the  function  of  a  buffer  is  to  prevent  cell  loss 
during  periods  of  traffic  overloads,  the  most  important 
performance  metric  of  an  ATM  buffer  architecture  is  the  cell 
loss  rate.  Unlike  other  packet-switched  networks,  which  are 
designed  to  support  data  applications  that  do  not  have 
stringent  delay  requirements,  an  AIM  network  must  also 
transport  real-time  applications;  consequently,  low  predict¬ 
able  latency  is  a  typical  requirement  of  an  ATM  switch.  In 
an  ATM  network,  the  order  in  which  cells  are  transmitted 
over  a  virtual  connection,  i.e.,  the  cell  sequence  integrity,  must 
be  maintained.  In  other  words,  the  goal  of  an  ATM  buffer 
organization  design  is  to  find  a  switching  fabric,  a  buffer 
set,  and  a  control  algorithm  so  that  the  cell  loss,  latency,  and 
cell  sequence  integrity  requirements  are  satisfied  while 
requiring  low  processing  complexity  and  a  small  number  of 
fibers. 

The  design  of  an  optical  ATM  buffer  organization  poses 
several  challenges: 

1.  Minimizing  cell  loss  and  latency.  A  low  cell  loss  rate 
can  be  achieved  by  providing  a  large  buffer,  but  this 
would  entail  large  latencies  at  the  switch;  conver- 
sely,  a  small  buffer  can  keep  cell  latency  low  but  may 
result  in  excessive  cell  loss.  Maintaining  low  latency 
and  minimizing  cell  loss  are,  therefore,  conflicting 
objectives.  In  the  design  of  an  optical  ATM  buffer 
organization,  using  long  fibers  tends  to  reduce  cell 
loss  without  increasing  the  number  of  fibers  since 
longer  fibers  can  accommodate  more  cells,  e.g.,  a 
length-/;  fiber  can  store  up  to  k  cells  at  any  particular 
time;  however,  longer  fibers  introduce  or  result  in 
greater  delays  as  the  transit  delay  through  a  length-/: 
fiber  is  k  time  slots.  Simultaneously  satisfying  cell 


Fig.  2.  A  2  x  2  optical  switching  element. 

loss  and  latency  requirements  with  few  fibers  is 
difficult. 

2.  Preserving  cell  sequence  integrity.  Maintaining  the 
order  in  which  cells  arrive  and  depart  on  each 
input-output  channel  pair  severely  constrains  how 
an  ATM  switch  can  be  organized.  The  switching 
fabric  of  a  high-performance  switch  typically  con¬ 
sists  of  a  network  of  smaller  elements,  with  each 
switching  element  having  its  own  buffer  organiza¬ 
tion  and  processing  unit.  To  preserve  the  cell 
sequence  integrity,  either  all  cells  on  each  input- 
output  channel  pair  traverse  the  same  sequence  of 
switching  elements  or  a  reordering  mechanism  at  the 
edge  of  the  switching  fabric  must  be  employed  to 
resequence  out-of-order  cells.  The  traversing  of  a 
unique  path  requires  that  sufficient  buffering  capa¬ 
city  be  available  at  each  switching  element. 

For  time-correlated  and  heavily  loaded  traffic,  a 
large  buffer  must  be  provided  at  each  switching 
element  to  prevent  excessive  cell  loss.  A  large  buffer 
introduces  higher  latency  and  results  in  higher  cost. 
Much  of  the  deleterious  effect  of  time-correlated 
traffic  over  an  input-output  channel  pair  can  be 
mitigated  by  distributing  the  traffic  over  several 
routes,  but  this  results  in  out-of-order  cells.  The 
mechanism  that  resequences  out-of-order  cells  typi¬ 
cally  requires  timestamps,  extra  buffer  capacity  at 
the  edge  of  the  switching  fabric,  and  delaying  of 
cells.  ^ 

3.  Limiting  factors.  The  limiting  factors  to  implementing 
an  ATM  buffer  organization  are  that  the  number  of 
fibers  must  be  small  and  that  the  processing 
requirement  must  be  low.  These  are  conflicting 
factors  since  the  use  of  fewer  fibers  requires  the 
use  of  a  sophisticated  control  algorithm  so  that  the 
few  fibers  are  used  efficiently.  Sophisticated  control 
algorithms  tend  to  have  higher  time  and  space 
complexity.  Designing  a  low-complexity  control 
algorithm  that  can  efficiently  utilize  few  fibers  is 
not  easy. 

4.  Performance  evaluation.  Performance  evaluation  of  an 
ATM  switch  poses  another  challenge.  Mathematical 
analysis  of  the  behavior  of  an  ATM  switch  is 
typically  intractable.  Simulation,  the  usual  alterna¬ 
tive  to  evaluating  mathematically  intractable  de- 
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Fig.  3.  An  M input  and  M-output  ATM  switch  with  an  optical  buffer  set. 

signs,  also  faces  significant  difficulty  since  the 
primary  performance  measure  of  interest  for  an 
ATM  switch  involves  the  sampling  of  statistically 
rare  events,  i.e.,  cell  loss.  For  example,  the  usual  cell 
loss  requirement  of  an  ATM  switch  is  on  the  order  of 
10-8.  To  ascertain  whether  an  ATM  architecture 
would  satisfy  such  a  criterion  requires  the  genera¬ 
tion  of  at  least  1010  ATM  cells  in  a  simulation  run. 
The  computational  cost  of  such  runs  is  prohibitive. 

3  The  Optical  ATM  Buffer  Organization 

We  address  the  design  of  a  small  switching  element,  as 
shown  in  Fig.  2.  Our  analysis  focuses  on  the  buffer 
organization  and  cell  loss  rate  as  a  function  of  the  number 
of  fibers.  This  allows  us  to  determine  the  minimum  number 
of  fibers  necessary  to  achieve  acceptable  performance  from 
the  switch. 

3.1  The  Optical  ATM  Switch  Model 

The  ATM  switch  model  that  we  use  to  evaluate  our  design 
has  N  input  and  M  output  channels,  which  are  labeled  0 
through  N  —  1  and  0  through  M  -  1,  respectively,  as  shown 
in  Fig.  3.  The  channels  are  synchronized  to  a  common  time 
line,  which  is  divided  into  infinitely  many  consecutive  time 
slots  of  unit  duration  corresponding  to  the  transmission 
time  of  a  single  ATM  cell.  Cell  arrivals  to  the  input  channels 
occur  only  at  the  beginning  of  a  time  slot.  A  cell  arriving  on 
input  channel  i  is  routed  instantaneously  to  output  channel 
j  with  probability  qij,  where  0  <  i  <  N  -  1,  0  <  j  <  M  -  1, 
qij  >  0.0,  and  qtJ  =  1  for  0  <  i  <  N  —  1.  A  cell  is 

transmitted  on  an  output  channel  only  at  the  beginning  of  a 
time  slot  and  departs  the  switch  at  the  end  of  the  same  time 
slot.  An  arriving  cell  to  an  output  channel  is  transmitted  on 
the  channel  if  the  channel  is  idle;  otherwise,  it  traverses  the 
switching  fabric  and  remains  there  until  the  output  channel 
becomes  available,  i.e.,  all  prior  arriving  cells  from  the  same 
input  channel  have  been  transmitted. 

Cells  arrive  on  input  channel  i  according  to  an 
independent  2-state  Markov-modulated  Bernoulli  process, 
MMBPi,  for  0<2<JV-1.  An  MMBPi  input  stream 
transitions  among  the  two  states  s\  and  s\.  At  the  beginning 
of  each  time  slot,  input  stream  i  moves  from  state  s*  to  slk 
with  probability  azjk  >  0,  where  1  <  j.  k  <  2  and  a*  x  4-  alj2  = 
1  for  j  =  1  and  2.  In  state  s[,  the  input  stream  i  generates  a 
cell  at  the  beginning  of  a  time  slot  and,  in  state  s2,  the  input 
stream  generates  nothing.  A  state  transition  diagram  of  a 
two-state  MM  BP  input  stream  is  shown  in  Fig.  4. 

A  phenomenon  common  to  all  discrete-time,  finite-buffer 
ATM  switches  with  multiple  input  and  output  channels  is 
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Fig.  4.  Transition  diagram  of  a  2-state  MMBP. 

that  many  cells  can  arrive  simultaneously  at  the  switch  and  it 
is  possible  that  not  all  of  the  cells  can  be  accommodated  in 
the  buffer.  There  must  be  a  protocol,  referred  to  as  an 
arbitration  scheme ,  to  determine  how  the  arriving  cells  are  to 
be  discarded.  The  arbitration  scheme  is  such  that  a  cell  from 
input  channel  i  is  admitted  into  the  buffer  before  a  cell  from 
input  channel  j  if  i  <  j.  We  evaluate  the  performance  of  our 
design  using  this  ATM  multiplexer  model.  The  simulation 
results  are  given  in  Section  4. 

3.2  The  Architecture 

We  now  present  our  optical  ATM  buffer  organization  and 
the  rationale  behind  its  design.  Because  the  efficacy  of  our 
optical  ATM  switch  is  due  to  the  remarkable  properties  of 
the  Fibonacci  numbers,  it  is  referred  to  as  the  Fibonacci 
Optical  ATM  Switch.  To  facilitate  the  presentation,  we 
introduce  the  following  variables: 

•  m,  the  number  of  distinct  fiber  types, 

•  Lit  the  length  of  a  type-i  fiber,  where  1  <  i  <  m, 

•  n  =  (ni? . . . ,  n™),  where  n*  is  the  number  of  fibers  of 
length  Lif 

•  n fib^s  =  the  total  number  of  optical  delay 

fibers, 

•  Dmax,  the  maximum  latency  of  the  switch. 

The  buffer  set,  the  switching  fabric,  and  the  control 
algorithm  of  the  architecture  are  as  follows: 

1 .  Buffer  Set 

a.  The  number  m  of  fiber  types .  m  is  set  to 

lOQio{\/S{Dmax  f))  ^ 

where  <p  =  and  Dmax  is  the  latency  require¬ 
ment  of  the  switch.  The  justification  for  setting 
this  value  to  m  can  be  found  in  Appendix  A. 

b.  The  type-i  fiber  length  Li,  where  1  <  i  <  m.  Li  is  set 
to  Fi+ 1,  where  Fi+i  is  the  (i  +  l)st  term  of  the 
Fibonacci  sequence  defined  by  Fi+\  =  Ft  4-  F}-i, 

=  0,  and  Fi  =  1.  This  buffer  set  is  referred  to 
as  the  Fibonacci  buffer  set. 

c.  The  number  ni  of  type-i  fibers,  where  1  <i  <m.  rii 
is  set  to  an  integer  that  is  greater  than  or  equal  to 
1.  While  m  and  Lt  are  uniquely  determined  by 
the  latency  requirement  Dmax,  the  proper  choice 
of  rii  depends  on  the  cell  loss  requirement  of  the 
switch  and  can  be  determined  through  simula¬ 
tion  studies. 

The  selection  of  m  and  Li  is  based  on  the  following 
observations: 

a.  The  fiber  lengths  should  be  exponentially  increasing. 
This  corresponds  to  requiring  that  L,  is  on  the 
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order  of  7',  where  7  >  y/2.  This  requirement  is 
due  to  the  fact  that  longer  fibers  can  retain  more 
cells.  As  a  result  of  using  exponentially  increas¬ 
ing  fiber  lengths,  the  addition  of  one  fiber  can 
significantly  increase  the  capacity  of  the  optical 
buffer. 

b.  The  fiber  lengths  should  be  such  that  there  exists  a 
binary  representation  (of  these  fiber  lengths)  for  each 
integer  between  1  and  Dnuix,  inclusive.  This  offers  a 
means  of  finding  a  length-??  fiber  sequence 
efficiently.  To  minimize  cell  latencies,  a  control 
algorithm  must  try  to  find  a  length-^  fiber 
sequence  to  buffer  a  cell  that  must  be  delayed 
for  at  least  n  time  slots;  however,  the  number  of 
possible  length-n  fiber  sequences  is  typically  an 
exponential  function  of  h , . . . ,  Lrn ,  m,  and  n  (see 
Appendix  B).  This  suggests,  that  we  cannot 
consider  all  possible  combinations  of  fiber 
sequences  of  length  n.  An  alternative  is  to 
permit  a  fiber  of  each  length  to  be  included  in 
the  sequence  at  most  once,  i.e.,  by  representing 
n  as  n  =  VjLj,  where  <7,- €{0.1},  and 
including  a  fiber  of  length  Lj  in  the  fiber 
sequence  only  when  oj  —  1.  We  say  that  a  = 

'  *  •  cti  is  a  binary  representation  of  n  on 
Lu...,Lrn.  A  buffer  set  consisting  of  fibers  of 
lengths  Li ,  Lo. . . . ,  Lw  is  referred  to  as  a  flexible 
buffer  set  if,  for  every  integer  n,  1  <  i  <  Dmax, 
there  exists  an  m-digit  binary  representation  or 
sequence  amavi-i  for  n. 

While  several  binary  representations  of  an 
integer  on  a  Fibonacci  buffer  set  may  be 
possible,  one  particular  binary  representa¬ 
tion — the  "greedy"  Fibonacci  representation — is 
used  by  the  control  algorithm  of  our  architec¬ 
ture.  The  greedy  representation  crmaTn-i  •  •  •  of 
n  €  [1,  Dmax]  on  LJtl,  . . . ,  L\  is  determined 
recursively  as  follows:  If  Fk+1  <  n  and  Fk+2  >  n, 
then  set  <rTn .  crr]t_i,  . . . ,  &k+i  to  0  and  ok  to  1.  The 
greedy  representation  of  n  -  Fk+i  on 
Lk-i ,  Lk„ 2; . . . ,  L\  determines  ok-\,  ok- 2, . . . ,  o\. 
For  example,  the  greedy  Fibonacci  representa¬ 
tion  of  the  integer  10  on  fiber  lengths  1,  2,  3,  5,  8, 

13  is  010010.  We  prove  (see  Appendix  C)  that 
the  greedy  representation  always  exists  and  that 
no  adjacent  Is  are  possible  in  the  representation. 
Consequently,  a  greedy  Fibonacci  representa¬ 
tion  consists  of  at  most  |f]  Is  and,  typically, 
very  few  Is  for  a  wide  range  of  integers. 

Although  the  2-exponential  buffer  set,  i.e., 
with  fiber  lengths  1. 2. 4. 8. . . . .  2ni“i,  also  has 
exponentially  increasing  fiber  lengths  and  is 
flexible,  the  property  of  requiring  very  few 
fibers  to  construct  a  fiber  sequence  makes  the 
Fibonacci  buffer  set  preferable  to  the  2-expo¬ 
nential  buffer  set  in  the  context  of  our  architec¬ 
ture.  A  comparison  of  the  two  flexible  buffer  sets 
can  be  found  in  Section  4.4. 


2.  Control  Algorithm.  The  control  algorithm  consists  of  a 
set  of  data  structures,  a  fiber  sequence  selection 
algorithm,  and  an  update  procedure,  as  follows: 

a.  Data  Structures 

i-  where  0  <  z  <  jV  -  1  and 

0  <  j  <  M  —  l.  Dmi„[i,j ]  at  time  n  is  the 
minimium  delay  that  a  cell  arriving  at 
input  i  and  departing  at  output  j  must  stay 
within  the  buffer.  This  preserves  the  cell 
sequence  integrity  over  input  i  and  output  j. 

ii.  T[i,j],  where  0  <  i  <  Dmax  -  1  and 
1  <  j  <  n fibers*  T  at  time  n  is  referred  to  as 
the  insertion  table  at  time  n.  T[i,j]  may  take 
on  one  of  two  values:  OCCUPIED  and 
EMPTY.  T[i, j]  =  OCCUPIED  indicates 
that  a  cell  will  be  inserted  into  the  fiber  j 
at  time  n  4-  i. 

iii.  Transmission^, j\,  where  0  <i<Drnax 
and  0  <  j  <  M  —  1.  Transmission^,  j] 
takes  on  a  value  of  either  YES  or  NO. 
Transmission^,  j]  =  YES  at  time  n  indi¬ 
cates  that  a  cell  will  be  transmitted  on 
output  channel  j  at  time  n  +  z  and 
Transmission^,  j]  =  NO  otherwise. 

b.  Fiber  Sequence  Selection  Algorithm 

i.  Tuning  Parameters.  The  fiber  sequence  selec¬ 
tion  algorithm,  which  is  described  in  Fig.  5, 
requires  the  following  tunable  parameters: 

A.  kMaxPemt-  Let  ^  be  the 

binary  representation  of  the  integer  k, 
i.e.,  k  =  <?jFj+i  and  <jj  €  {0, 1}, 

and  at, .  ok2 . <Jk,  be  the  nonzero 

elements  of  a.  There  are  j\  permuta¬ 
tions  of  length-k  fiber  sequences.  This 
suggests  that  the  algorithm,  in  spite  of 
considering  only  the  binary  represen¬ 
tation  on  the  fiber  lengths,  needs  to 
examine  up  to  j\  permutations  to 
determine  whether  a  cell  can  be 
admitted  to  the  buffer.  kMaxP€rm, 
where  1  <  kMaxPemt  <  m  restricts 
the  maximum  number  of  possible 
length-^  fiber  sequences  that  die  algo¬ 
rithm  examines  before  making  a  deci¬ 
sion  on  whether  an  unused  length-A: 
fiber  sequence  exists. 

B.  h  Max  Backoff  *  Suppose  that  a  cell  destined 
to  output  j  arrives  on  input  channel  i  at 
time  n  and  that  it  needs  to  be  delayed 
for  at  least  k  time  slots.  The  algorithm 
first  finds  an  integer  m!  such  that  m!  > 
k  and  there  is  no  scheduled  transmis¬ 
sion  on  the  output  j  at  time  n  +  mf.  It 
then  determines  whether  there  exists  a 
fiber  sequence  of  length  m! .  If  so,  the 
cell  is  admitted;  otherwise,  the  algo¬ 
rithm  attempts  to  find  an  integer  l  >  vn! 
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FindFiborSequeiicc  (dmf*n,  j  ) 

Begin 

A 7  umbe  r  BackO  f  f  —  0: 

Found  Fiber  Sequence  —  NO: 

If  k algorithm  =  Always  tllGll 

Begin 

Set  k  to  be  the  smallest  integer  such  that 
d-min  —  k 

Tran$mi$sion\k*j]  —  NO  and 

The  greedy  Fibonacci  representation  of  k  consists,  of  at  most  k\farFibers  l's; 

End 

Else 

Begin 

Set  k  to  be  the  smallest  integer  such  that 
d-min  Fi  k  and 

Tro.  n  sm  i  .s.s  ion[k.j]  —  NO: 

End: 

While  {(Found  Fiber  Sequence  =  NO)  and  (k  <  Dmax  )  and  (N  umber  BackO  f  f  <  kMaxBackOff  )) 
Begin 

Let  o  =  om  . .  .fTj  be  the  greedy  Fibonacci  representation  of  k 

Let  _ <Tkri  be  the  nonzero  elements  of  a 

N  umPcrm  Examined  —  0: 

Found  Permutation  —  NO: 

While  ((Ar umPerm Examined  <  min(kjijaxptrm. n!))  and  (Found Permutation  =  A'O)) 

Begin 

Num Perm  Examined  —  NumPev  Examined  +  1; 

Select,  a  permutation  on  Lkn .  - -  L/t,  .  say.  /,*n  ; 

Determine  whether  there  exists  /  =  {/*„.  —  4E  where  4,  is  a  fiber  of  length 

such  that  each  of  T[0.4J.  T[Lk„Jk„^i] . ^E?=2  is  ^qual  to  EMPT\  ; 

If  /  exists  then 
Begin 

Schedule  the  cell  to  traverse  the  fiber  sequence  4„4„_i  *  *  *4*i? 

^mm[f  j]  “  fr; 

j']  —  IlES: 

Set  each  of  T{ 0.4n]«  r[Zjtn.4-n_i] - to  OCCUPIED ; 

Found  Permutation  —  YES: 

Found  Fiber  Sequence  —  YES: 

End; 

End: 

If  (Found  Fiber  Sequence  =  ArO)  then 
Begin 

Number  BackO  f  f  —  Number  BackO  ff  +  1; 

Set,  &  —  ATca:#Va/ue(Ar,j).  where  N extValue(kJ)  is  the  smallest  integer  /  such  that 

A'  <  / 

Tran.sm?*s$?7m[/, j]  =  ArO  and 

The  greedy  Fibonacci  representation  of  /  requires  no  more  than  kMaxFibers 

End: 

End: 

End. 


Fig.  5.  Fiber  sequence  selection. 

such  that  there  is  no  output  collision  at 
time  n  +  l  and  there  exists  a  fiber 
sequence  of  length  l  to  accommodate 
the  cell.  This  is  refered  to  as  backoff. 
Although  backing  off  can  accommo¬ 
date  a  cell  that  is  otherwise  discarded, 
it  has  a  serious  deleterious  effect:  The 


subsequent  arriving  cells  of  the  (i,j) 
input-output  channel  pair  at  time  n  + 
1  must  be  delayed  by  Z,  rather  than 
ml ,  time  slots.  Since  a  cell  requiring  a 
larger  delay  typically  needs  to  use 
more  fibers,  the  performance  of  the 
architecture  suffers  as  the  number  of 
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backoffs  increases.  kuaxDuckOjfr  which 
can  take  on  any  value  between  1  and 
A/mx  -  I,  restricts  the  number  of  times 
that  the  control  algorithm  backs  off  to 
accommodate  an  arriving  cell. 
k  Max  Fibers*  Although  the  maximum 
number  k  of  fibers  required  to  represent 
an  integer  between  1  and  Dmax  is  ],  k 
becomes  large  as  m  increases.  k\/uxFibers 
limits  the  maximum  number  of  fibers 
that  can  be  used  to  accommodate  a  cell. 
The  effect  of  kMaxFibers  is  as  follows: 
Suppose  that  a  cell  needs  to  be 
delayed  for  at  least  k\  time  slots, 
but  the  greedy  Fibonacci  representa¬ 
tion  of  ki  requires  n  Is,  where 
71  >  k  Max  Fibers*  The  algorithm  must 
find  the  smallest  integer  ko  such  that 
ko  >  ki  and  that  its  binary  represen¬ 
tation  consists  of  no  more  than 
kMux.Fibers  Is.  A  consequence  of  using 
kMaxFibers  is  an  increase  in  the  delay 
of  the  cell.  Because  of  the  cell 
sequence  integrity  requirement,  the 
delays  of  subsequently  arriving  cells 
also  increase.  kMaxFibers  can  be  used 
in  two  contexts,  depending  on 
whether  the  parameter  kaigorithm  is 
set  to  Always  or  BackOffOnly.  If 
kaigorithm  —  BackOff Only,  the  para¬ 
meter  kMaxFibers  is  used  only  when  the 
algorithm  is  performing  a  backoff.  If 
kaigorithm  =  Always,  each  buffered  cell 
must  always  traverse  no  more  than 
kMaxFibers  fibers.  kMaxFibers  may  take  on 
any  value  between  1  and  inclusive. 

D.  kaigoritfwl.  kaigorit^,  takes  on  one  of  two 
values:  Always  and  BackOffOnly.  This 
parameter  specifies  whether  the  restric¬ 
tion  of  using  kMaxFibers  fibers  in  a  fiber 
sequence  is  always  used  or  is  used  only 
during  a  backoff. 

ii.  The  Algorithm.  Upon  receiving  a  cell  that 
has  arrived  on  input  i  and  is  destined  to 
output  j,  the  algorithm  examines  Dmiri  [i.j] 
and  Transmission  [0,  j]  to  determine 
whether  the  cell  needs  to  be  buffered.  If 
Transmission^,  j)  =  YES  or  Z>min[z\  j]  >  0, 
the  cell  must  be  buffered.  The  minimum 
time  dmir,  that  the  cell  needs  to  be  buffered 
is  rnax{l.  Dmir\i,j]}.  The  FindFiberSe- 
quence  procedure  with  input  parameters 
dmin  and  j  is  invoked  to  find  a  length-A: 
fiber  sequence  such  that  dmiTi  <  k  and 
Transmission^,  j]  =  NO ;  that  is,  the  proce¬ 
dure  finds  a  fiber  sequence  of  length  k  such 
that  dmin  <  k  so  as  to  maintain  cell  sequence 
integrity  and  such  that  no  other  cell  has 


been  scheduled  to  be  transmitted  on  output 
channel  j  at  time  k.  The  procedure  is  shown 
in  Fig.  5. 

iii.  The  Update  Algorithm.  At  the  beginning  of 
each  time  slot,  the  data  structures  Dmin,  T, 
and  Transmission  are  updated  as  follows: 

Dmin[i,j]  <-  max( 0,  Dmin[iJ\  -  1) 
for  each  (z,j)-pair  such  that 
0  <  i  <  N  -  1  and  0  <  j  <  M  -  1 
T[i,j]  4-  T[i  +  1  ,j]  for  0  <  i  <  Drnax  -  1 
and  T[Dmax  -  1,  j]  <-  EMPTY , 
where  1  <  j  <  nfiber. 

Transmission^,  j\  Transmission [i  +  1,  j], 

for  0  <  i  <  Dmax  -  1 
and  Transmission[Dmax:  j]  4—  NO, 
where  0  <  j  <  M  —  1. 

Switching  Fabric.  The  switching  fabric  of  the  ATM 
switch  transfers  a  cell  arriving  on  an  input  channel 
or  exiting  a  delay  fiber  to  an  output  channel  or 
another  delay  fiber.  Any  nonblocking  (N  +  njlbers)  x 
(M  4-  n fibers)  switch,  such  as  a  crossbar,  as  shown  in 
Fig.  1  for  M  =  N  =  2  or  a  Benes  network  [11],  can  be 
used  as  the  switching  fabric. 

Example  of  Operation 


3.3 

We  now  illustrate  the  basic  operation  of  the  control 
algorithm  and,  in  particular,  of  the  FindFiberSequence 
procedure.  Let  N  =  M  =  2,  kMaxFibers  =  3,  and  Dmax  =  10. 
Then,  m  =  5,  h  =  1,  L2  =  2,  T3  =  3,  LA  =  5,  and  T5  =  8.  Let 
rii  =  1,  n2  =  1,  ri3  =  1,  n4  =  1,  and  n$  =  1.  Suppose  that,  at 
time  n,  the  states  of  the  data  structures  Dmin/  T,  and 
Transmission  are  as  shown  in  Fig.  6  and  that  a  cell  that  is 
destined  for  output  0  arrives  at  input  1.  Because 
Drain  [1.0]  =3.  the  cell  must  be  buffered  for  at  least  three 
time  slots.  Time  n  +  3  would  be  the  earliest  time  that  the  cell 
may  be  transmitted  at  output  0,  but  Transmission^,  0]  =  X 
indicates  that  a  cell  has  already  been  scheduled  to  use  the 
output  at  time  n  +  3.  Transmission [4, 0]  =  EMPTY  indi¬ 
cates  that  the  output  is  idle  at  time  n  +  4  and  the  cell  may  be 
transmitted  at  that  time.  This  implies  that  the  cell  must  be 
buffered  for  four  time  slots. 

Because  the  greedy  Fibonacci  representation  of  four  on 
fiber  lengths  1, 2, 3, 5,  and  8  is  00101,  the  cell  must  traverse  a 
fiber  sequence  consisting  of  fibers  of  lengths  1  and  3.  There 
are  two  such  fiber  sequences,  hl3  and  l3lu  If  the  cell  were  to 
traverse  the  sequence  hl3,  T[0. 1]  and  T[l,3]  would  have  to 
be  empty  in  order  for  the  cell  to  be  inserted  into  fiber  1  at 
time  n  and  into  fiber  3  at  time  n  + 1.  T[  1. 3]  =  X  and 
T[0.3]  =  X  indicate  that  neither  of  the  two  sequences  l3l3 
nor  l3li  can  be  used.  The  control  algorithm  must  back  off  to 
another  integer  larger  than  4.  Transmission [5, 0]  = 
EMPTY  indicates  that  the  cell  may  be  transmitted  on 
output  0  at  time  n  +  5  and  that  it  should  be  delayed  for  five 
time  slots.  The  greedy  Fibonacci  representation  of  5  is 
01000.  The  representation  has  only  one  nonzero  element 
and,  because  kna XFibcrs  >  1,  5  is  an  eligible  backoff 
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Transmisson 


T 


0 

1 

2 

3 

4 

5 

6 

7 

8 
9 


X 

X 

X 

X 

X 

X 

X 

X 

1 

X 

can  be  altered  dramatically.  Since  Dmax.  grows  linearly  with 
Fm  and  Fm  is  an  exponentially  increasing  function  of  m, 
increasing  m  by  1  would  result  in  an  exponential  increase  in 
Dnua.  Therefore,  the  size  of  the  insertion  table  Dmaxn fibers 
can  grow  exponentially  with  n  fibers-  It  should  be  noted  that 
making  Dmaxn fiber*  large  constitutes  only  part  of  the 
solution.  The  other  part  is  to  ensure  that  most  of  the 
Dmaxn  fibers  entries  can  be  utilized.  This  requires  the  proper 
selection  of  n.  Because  the  Fibonacci  buffer  set  is  flexible,  a 
cell  needing  to  be  delayed  for  k  time  slots  has  the 
opportunity  to  be  delayed  for  exactly  k  time  slots  unless  it 
is  backed  off  to  a  larger  integer.  In  cases  where  backoffs  are 
rare,  most  cells  are  delayed  for  an  amount  that  is  close  to 
their  theoretical  minimum. 


Fig.  6.  State  of  T,  D,nin,  and  Transmission . 

candidate.  Because  T[ 0, 5]  is  unoccupied,  the  cell  may  be 
inserted  into  fiber  5  at  time  n.  The  algorithm  schedules  the 
cell  to  traverse  fiber  5  at  time  n  and  updates  the  data 
structures  Diuivf  T,  and  Transmission  accordingly. 

3.4  Salient  Characteristics  of  the  Architecture 

The  architecture  regards  the  Dmaxnfucr *  entries  of  the 
insertion  table  T  as  "resources";  hence,  the  time  dimension 
as  well  as  the  fibers  can  be  exploited  to  store  cells.  A  cell 
needing  to  be  buffered  is  considered  to  be  a  "consumer"  of 
the  resources.  The  efficacy  of  the  architecture  depends  on  its 
ability  1)  to  limit  the  number  of  entries  used  by  each  cell, 
2)  to  reduce  the  probability  of  time-correlated  cells 
contending  for  the  same  set  of  resources  at  about  the  same 
time,  and  3)  to  make  the  product  Dmuxnfners  large  while 
keeping  nfib(JTi!  small.  All  of  these  goals  must  be  achieved 
without  incurring  prohibitively  large  delays. 

The  Fibonacci  buffer  set,  in  conjunction  with  the  control 
algorithm,  can  satisfy  1),  2),  and  3)  without  relaxing  the 
latency  or  cell  sequence  integrity  requirements.  The  max¬ 
imum  number  of  nonzero  elements  in  the  greedy  Fibonacci 
representation  of  an  integer  between  1  and  Drnax,  inclusive, 
is  no  larger  than  and,  therefore,  a  cell  may  consume  at 
most  [tf|  entries.  Although  p£"|  is  already  a  rather  small 
number,  in  practice,  the  number  of  fibers  required  by  a  cell 
is  much  smaller  than  this  theoretical  maximum.  The 
contention  of  the  same  table  entries  at  about  the  same  time 
is  alleviated  by  permuting  the  fiber  sequences  to  be 
traversed  and  by  backing  off.  For  example,  the  two 
permutations  hjkn-i'"hl  and  4;  h2 ' ' '  of  a  fiber  se¬ 
quence  on  Lkl.Lk2----,Lk„  correspond  to  two  completely 
different  sets  of  entries  in  the  insertion  table.  As  the  number 
n  of  fibers  in  a  fiber  sequence  increases,  the  number  of 
permutations  increases  exponentially,  i.e.,  n\.  This  offers 
massive  redundancy  when  finding  a  corresponding  set  of  n 
unused  entries  is  difficult.  The  backoff  algorithm  alters  the 
set  of  resources  required  by  an  arriving  cell. 

The  Fibonacci  buffer  set  has  the  remarkable  property  that 
the  greedy  representations  of  adjacent  integers  k  and  k  +  1 
look  very  different  for  a  wide  range  of  ks.  For  example,  the 
greedy  representations  of  1  to  10  on  fiber  lengths  1,  2,  3,  5, 
and  8  are  00001,  00010,  00100,  00101,  01000,  01001,  01010, 
10000, 10001,  and  10010,  respectively.  By  backing  off  a  delay 
of  k  to  k  4-  1,  for  example,  the  resources  required  by  a  cell 


3.5  Time  and  Space  Complexity 

The  actual  time  complexity  of  the  control  algorithm 
depends  on  the  tuning  parameters.  We  found  that  the 
architecture  is  most  robust  when 

k Max  Fibers  5 

kaigorithm  ~  FackO  f  /Only , 

k  Max  Perm  =:  2, 

and 

kMuxBuckOff  ~  Dmax  1. 

The  complexity  of  the  control  algorithm,  measured  in  terms 
of  the  switch  parameters  Dmax/  N,  M,  and  s, 
corresponding  to  this  set  of  tuning  parameters  is  as  follows: 

1 .  Space  Complexity.  The  space  complexity  of  the  control 
algorithm  refers  to  the  memory  requirement  for  the 
data  structures  Dminr  T,  and  Transmission.  Because 
T  requires  the  most  memory,  the  space  complexity 
of  the  algorithm  is  0(D  max  n fibers)  bits. 

2.  Time  Complexity.  Because  a  fiber  sequence  may 
contain  up  to  p|l  fibers  and  m  =  0{logdDmax)r  the 
control  algorithm  requires  Oilog^Dmax)  steps  to 
determine  whether  there  exists  a  fiber  sequence  of 
length  k.  Since  there  are  up  to  Dmax  ks  to  consider, 
the  time  complexity  of  the  algorithm  per  cell  is 
0(pmaxlog0Dmax).  In  a  lossless  architecture,  how¬ 
ever,  the  expected  time  complexity  is  approximately 
0(log<t>Dmax),  as  the  expected  number  of  backoffs  is 
small.  The  parameter  Dma: T  (the  maximum  latency 
that  a  cell  may  be  buffered  in  the  switch)  is,  in 
practice,  a  small  constant  because  an  ATM  switch 
typically  has  a  very  stringent  latency  requirement. 
Therefore,  the  control  algorithm  has  low  expected 
time  complexity  in  such  an  architecture  and  can  be 
executed  in  real  time. 

4  Performance  of  the  Architecture 

We  evaluated  the  performance  of  the  architecture  under  a 
diverse  set  of  time-correlated  and  heavily  loaded  traffic 
profiles  through  simulation  studies.  For  each  traffic  profile, 
we  obtained  the  buffer  set  (ni.no. ...  }nm)  that  resulted  in 
an  empirically  lossless  architecture.  An  ATM  switch  is 
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empirically  lossless  if  no  cell  loss  can  be  detected  during  a 
simulation  run  of  0(1O7)  time  slots.  Hie  model  for  the 
performance  evaluation  is  described  in  Section  3.1.  To 
simplify  the  evaluation,  we  set  jV  =  A/  =  2,  as  our  goal  is  to 
design  a  buffer  organization  for  a  small-dimension  ATM 
switching  element,  with  symmetric  traffic  profiles,  i.e.,  the 
input  process  of  each  input  channel  is  the  same  and 
destinations  of  arriving  cells  are  uniformly  distributed 
among  the  M  output  channels.  This  corresponds  to  setting 
Qij  =JI  for  0  <  ij  <  1  and  a*.  =  a0-  for  k  =  0, 1  and 

We  define  X  and  X Bernoulli  to  be  the  number  of 
consecutive  time  slots  in  which  the  2-state  MMBP  of  an 
input  channel  and  a  Bernoulli  process,  respectively,  would 
generate  a  cell  in  each  time  slot.  The  time  correlation  index  r 
of  the  2-state  MMBP  is  defined  as  r  =  r(-^(X)  -  - ;  therefore, 
the  Bernoulli  process  in  which  cells  are  generated  indepen¬ 
dently  of  each  other  has  a  time  correlation  index  of  1.  An 
input  process  in  which  cell  arrivals  are  positively  time- 
correlated  has  r  >  1,  where  a  larger  value  of  r  indicates 
more  time  correlation.  Because  we  assume  that,  for 
symmetry,  destinations  of  cells  are  uniformly  distributed 
among  the  output  channels,  the  correlation  of  the  destina¬ 
tions  of  cells  is  not  considered.  The  offered  load  p  and  the 
time  correlation  index  r  of  a  2-state  MMBP  uniquely 
determine  the  MMBP's  state  transition  probabilities  for 
1  <  hj  <  2  as  follows: 


T-(l-p) 

(i) 

;  aio  —  1  —  an 

T 

T  -  p 

(2) 

—  .  Cto\  =  1  —  CX  oo. 

T 

We  evaluated  the  performance  of  the  switch  under  traffic 
profiles  with  p  =  0.6. 0.7,  and  0.8,  and  r  =  1,2, 4, 8, 16,  and 
32  through  simulation  studies.  The  duration  of  each 
simulation  run  was  107  time  slots.  We  anticipated  that  the 
sampling  error  of  the  simulation  studies  would  be  con¬ 
siderable  at  the  region  where  the  cell  loss  rate  is  extremely 
small,  e.g.,  less  than  10~D.  Because  the  number  of  cells 
generated  for  each  run  is  on  the  order  of  107,  a  cell  loss 
value  that  is  less  than  10~6  should  be  construed  as 
simulation  noise.  Although  such  a  cell  loss  value  cannot 
be  regarded  as  statistically  valid,  it  provides  the  valuable 
indication  that  cell  loss  is  rare  in  the  switch  architecture. 

The  sample  distribution  of  dm?n/  which  is  the  minimum 
time  that  an  arriving  cell  needs  to  be  delayed,  was  also 
collected.  We  used  the  sample  distribution  to  estimate  the 
latency  experienced  by  each  arriving  cell.  For  various  buffer 
sets,  latencies  Dmax  and  traffic  profiles,  the  cell  loss  rates  are 
shown  in  Figs.  7, 8, 9, 10, 1 1,  and  12.  Because  a  cell  loss  rate  of  0 
corresponds  to  -oc  on  the  y-axis,  we  substituted  10“7  as  the 
value  for  0  in  Figs.  7,  8,  9,  10,  11,  and  12.  The  sample 
distributions  of  dmm  for  cases  where  p  -  0.8  and  r  = 
L  2, 4, 8, 16, 32  are  shown  in  Figs.  13, 14, 15, 16, 17,  and  18.  In 
each  case,  the  corresponding  empirically  lossless  buffer  set 
tabulated  in  Table  1  in  Section  4  is  used.  The  relative 
frequency  at  latency  k  is  the  percentage  of  arriving  cells  that 
find  dmin  =  k.  Because  a  relative  frequency  of  0  corresponds  to 


Fig.  7.  Cell  loss  rates  for  p  =  0.6. 0.7. 0.8  and  r  =  1. 

-oo  on  the  y-axis,  we  substituted  10“8  for  a  relative  frequency 
of  0  in  the  figures.  Note  the  exponential  decay  of  the  relative 
frequencies  in  all  of  the  figures.  For  example,  in  Fig.  13,  the 
relative  frequency  for  dmin  =  2  is  on  the  order  of  10'1,  but  the 
relative  frequency  for  dwirt  =  5  is  on  the  order  of  10"3,  which  is 
a  decrease  in  the  relative  frequency  by  two  orders  of 
magnitude  as  dmiri  increases  from  2  to  5.  The  fact  that  all  of 
the  sample  distributions  of  dWin  exhibit  this  behavior 
indicates  that  the  latency  mass  function  of  the  Fibonacci 
switch  decreases  very  rapidly  as  the  latency  increases. 

4.1  Tunable  Parameters 

After  examining  the  impact  of  the  tuning  parameters  on  the 
architecture  through  several  simulation  studies,  we  found 
that  the  backoff  mechanism,  due  to  the  cell  sequence 
integrity  constraint,  can  have  a  deleterious  effect  on  the 
performance  of  the  switch.  Consequently,  slowing  the 
backoff,  i.e.,  backing  off  to  a  small  integer  rather  than  to  a 
large  integer,  is  crucial.  This  corresponds  to  setting 
k  Max  Fibers  =  Rrl-  We  also  found  that,  as  long  as  each  backoff 
is  slow,  allowing  a  large  number  of  backoffs  is  preferable, 
i.e.,  setting  kMaxB(1ckOf}  —  Dmax  -  1.  We  also  conducted 
studies  to  examine  whether  the  performance  of  the  switch 
would  improve  by  searching  all  possible  permutations  of  a 
fiber  sequence  and  found  that,  in  many  cases,  searching  for 
more  permutations  does  not  significantly  improve  the  cell 
loss  performance  of  the  architecture  (see  Appendix  F). 


Fig.  8.  Cell  loss  rates  for  p  =  0.6. 0.7. 0.8  and  r  =  2. 
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Fig.  9.  Cell  loss  rates  for  p  -  0.6. 0.7. 0.8  and  r  =  4. 

Empirically,  searching  for  only  two  permutations  of  a  fiber 
sequence  on  L*, ,  Z*2 . . . . ,  Lk„  suffices  to  achieve  good 
performance.  In  particular,  assuming  that 
kTI  >  kj,- 1  >  •  ♦  •  >  k\,  the  control  algorithm  examines  only 
the  two  permutations  hjk„^  *  *  *  4t  and  Z*,  h-2  ■  ■  •  hu  • 

4.2  Heuristic  for  an  Empirically  Lossless  Switch 
Architecture 

A  buffer  set  that  results  in  an  empirically  lossless  switch 
architecture  is  referred  to  as  an  empirically  lossless  buffer 
set .  Because  a  small  value  of  Dmax  would  require  a  large 
value  of  rifibers  to  keep  D-puix'^  fibers  lsrge,  we  vary  Drnax  as 
if  it  were  a  design  parameter  to  determine  its  effect  on 
the  architecture.  Our  goal  is  to  search  for  the  largest 
acceptable  Dmax  so  that  the  number  of  fibers  used  in  an 
empirically  lossless  buffer  set  is  as  small  as  possible.  A 
buffer  set  n  =  (ni,n2, . . .  ,n„,)  can  be  augmented  by  one 
fiber  in  two  ways:  n  becomes  (ni.m, . . .  .nw,  1)  or  n 
becomes  (ni,n2, . . .  ,nm)  4-  6jm,  where  1  <  j  <  m  and  is 
an  m-vector  with  1  in  the  ;th  entry  and  zero  otherwise.  That 
is,  a  buffer  set  can  be  augmented  by  adding  a  fiber  of  a  new 
fiber  type  or  a  fiber  of  an  existing  fiber  type.  Adding  a  fiber 
type  of  length  Fm+o  requires  Dmax  to  be  modified  so  that 

-^7/1+2  ^  Dmax* 

The  heuristic  that  we  used  to  obtain  the  empirically 
lossless  buffer  sets  under  various  traffic  scenarios  is  as 
follows:  Initially,  we  set  Dmax  =  1,  m  =  1,  and  n  =  (1).  We 


Fig.  10.  Cell  loss  rates  for  p  =  0.6. 0.7. 0.8  and  r  =  8. 
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Fig.  1 1 .  Cell  loss  rates  for  p  —  0.6. 0.7, 0.8  and  r  =  16. 

then  conducted  a  simulation  study  to  determine  whether 
n  =  (1)  is  an  empirically  lossless  buffer  set.  If  not,  we  set 
Dmax  «—  2 Dn,ax  and  m  <—  i  such  that  Fi+ 1  is  the  largest 
Fibonacci  number  that  is  no  larger  than  the  new  Dmax «  The 
number  of  fibers  for  each  new  fiber  type  introduced  is  1.  For 
example,  the  design  parameters  Dmax  =  1,  m  =  1,  and  n  = 
(1)  become  Dmax  =  2,  m  —  2,  and  n  =  (l,l).  We  then 
conducted  another  simulation  run  to  see  whether  the 
augmented  buffer  set  (1, 1)  results  in  an  empirically  lossless 
architecture.  The  process  of  extending  the  buffer  set  by 
adding  a  new  fiber  type  continues  until  a  new  buffer  set 
yields  a  higher  cell  loss  rate  than  the  previous  buffer  set.  We 
then  regress  to  the  previous  buffer  set  (assuming  that  it 
consists  of  rri  fiber  types)  and  augment  it  by  adding  a  fiber 
of  length  1.  Then,  we  add  a  fiber  of  length  2  if  the  new 
buffer  set  is  not  empirically  lossless  and  so  on  until  =  nj 
for  all  iff  between  1  and  m'.  The  buffer  set,  if  needed  to  be 
extended,  is  augmented  by  adding  a  new  fiber  type  of 
length  Fm‘+i  and  the  process  repeats. 

The  empirically  lossless  buffer  sets  obtained  by  the 
heuristic  for  the  cases  where  p  —  0.8  and  r  = 
1,2,4.8,16,32  are  shown  in  Table  1.  The  corresponding 
Dmux  and  the  total  number  n fibers  of  fibers  in  each  buffer 
set  are  also  tabulated.  For  example,  for  the  case  where 
r  —  32,  the  empirically  lossless  buffer  set,  the  maximum 
latency  Dmax/  and  the  total  number  of  fibers  used  are 
n=  (2. 2. 2, 2, 2, 2. 1,1,1),  64,  and  15,  respectively,  n  — 


Fig.  12.  Cell  loss  rates  for  p  —  0.6. 0.7. 0.8  and  r  =  32. 
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Fig.  13.  Sample  distribution  of  d,„n}  for  p  =  0.8  and  7  =  1.  Fig.  15.  Sample  distribution  of  dmin  for  p  =  0.8  and  r  =  4. 


(2, 2. 2. 2. 2. 2, 1,1,1)  indicates  that  there  are  two  fibers  of 
each  fiber  type  of  lengths  1 ,  2,  3,  5,  8,  and  13  and  one 
fiber  of  each  fiber  type  of  lengths  21,  34,  and  55. 

4.3  Feasibility  of  an  Empirically  Lossless  Switch 
Architecture 

We  now  discuss  the  feasibility  of  implementing  an 
empirically  lossless  switch  architecture  under  the  most 
bursty  traffic  profile,  i.e.,  r  =  32  and  p  =  0.8,  that  we  have 
examined.  The  expected  number  of  consecutive  cell  arrivals 
on  an  input  channel  is  160.  Although  the  expected  burst 
length  on  an  input  to  an  ATM  switch  can  be  arbitrarily 
large,  we  assume  that  ATM  traffic  management  mechan¬ 
isms,  such  as  call  admission  control,  routing,  and  policing, 
of  the  network  would  regulate  traffic  so  that  the  expected 
burst  length  on  an  input  channel  exceeding  a  threshold  as 
large  as  160  is  unlikely.  The  empirically  lossless  buffer  set 
for  the  traffic  profile  consists  of  15  fibers.  This  suggests  that 
the  corresponding  switching  fabric  must  be  of  dimensions 
17  x  17. 

Of  the  possible  optical  switching  fabrics,  the  classical 
crossbar  is  perhaps  the  most  desirable  because  of  its  control 
and  design  simplicity,  but,  since  the  number  of  crosspoints 
required  grows  quadratically  with  the  numbers  of  inputs 
and  outputs,  a  concern  is  that  it  may  be  impossible  or 
impractical  to  implement  a  17  x  17  optical  crossbar.  It  has 
been  found  that  optical  crossbars  as  large  as  32  x  32  are 


feasible  [1];  therefore,  implementing  the  empirically  lossless 
buffer  set  using  an  optical  crossbar  is  well  within  the  limits 
achievable  by  today's  technology.  It  is  difficult  to  build  a 
large  optical  crossbar  so  that  all  of  its  crosspoints  function 
properly;  consequently,  the  fabrication  yield  of  large  optical 
crossbars  with  perfect  crosspoints  is  typically  low.  Because 
of  the  greedy  Fibonacci  representation,  a  cell  cannot 
traverse  fibers  of  length  Fr  and  Fi+X  for  any  i  where  2  < 
i  <  m  +  1  and,  therefore,  a  fiber  of  length  F+  need  not  be 
connected  to  a  fiber  of  length  Fi+1  and  vice  versa.  This 
means  that  an  imperfect  optical  crossbar  can  be  used  as  the 
switching  fabric  of  our  architecture  and  that  the  yield  of 
constructing  the  fabric  would  be  high.  The  ability  of  our 
architecture  to  function  with  imperfect  switching  fabrics 
significantly  enhances  its  viability. 

4.4  Comparison  of  Fibonacci  and  2-Exponential 
Buffer  Sets 

We  now  show  why  the  Fibonacci  buffer  set  performs  better 
than  the  2-exponential  buffer  set,  even  though  both  are 
flexible  and  have  exponentially  increasing  fiber  lengths.  The 
key  observation  is  that  the  more  effective  buffer  set  should, 
on  average,  require  fewer  fibers  to  store  an  ATM  cell  and 
should  have  fewer  Is  in  the  cell's  binary  representation. 

We  show  that  the  Fibonacci  buffer  set  requires  fewer  Is 
to  represent  a  set  of  integers  between  1  and  n,  where  n  is 
very  large,  thereby  demonstrating  that  it  is  more  efficient. 


Fig.  14.  Sample  distribution  of  dmin  for  p  -  0.8  and  r  =  2. 


Fig.  16.  Sample  distribution  of  dmin  for  p  -  0.8  and  r  =  8. 


476 


Latency 


Fig.  17.  Sample  distribution  of  dmin  for  p  —  0.8  and  r  —  16. 

Let  ki  and  ho  be  the  largest  integers  such  that  2k 1  <  n  and 
Fk.,+i  <  n,  respectively.  Any  integer  x  between  1  and  n  can 
be  represented  as  x  =  Y.%o  anc* x  =  Sjii  where 

xpj,  0j  €  {0, 1}.  We  wish  to  determine  whether  there  exists  an 
asymptotic  relationship  between  k\  and  ko,  as  n  becomes 
very  large.  Since  n<  2k'—l,  ki  =  \logo(n  4-  1)].  As  n 
becomes  large,  k\  w  logon;  similarly,  since  ■^0/C2+1  <  n, 
k%  «  From 

I  7  /Offlpn 

«1  ^  1092™  ^  login? 

k2  ~  logJy/Bn)  ~  io«i.,(V5n) ; 

it  follows  that: 

/c2  %102  0.301029995 

This  implies  that  ko «  1.44fci,  as  n  becomes  large.  If 

1. 2. 4. 8. ,  2frl ,  i.e.,  the  2-exponential  buffer  set,  were  used 
to  represent  a  fiber  sequence  of  length  x,  where  x  is 
uniformly  distributed  between  1  and  n,  then  the  expected 
number  of  Is  in  the  binary  representation  of  a;  is  0.5fci.  If 

1. 2. 3. 5. 8. . .. ,  Fk2+\ ,  i.e.,  the  Fibonacci  buffer  set,  were  used 
to  represent  x,  the  expected  number  of  Is  in  the  binary 
representation  of  x  is  0.2763 ko,  as  0.2763  is  the  expected 
number  of  Is  in  a  digit  of  a  greedy  Fibonacci  representation. 
Although  k2  «  1.44/ci,  (0.2763)(1.44)fci  «  0.4fci;  hence,  the 
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Fig.  18.  Sample  distribution  of  dmin  for  p  =  0.8  and  r  =  32. 
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TABLE  1 

Empirically  Lossless  Buffer  Sets  for  Cases 
where  p  =  0.8  and  r  =  1 , 2, 4, 8, 16, 32 


Fibonacci  representation  requires  only  0.4&i  Is  to  represent 
an  integer,  whereas  the  2-exponential  representation  re¬ 
quires  0.5&1. 

We  performed  two  simulation  studies  to  compare  the 
efficacy  of  the  two  buffer  sets;  the  results  validated  the 
conclusion  of  the  mathematical  comparison.  In  one  study, 
the  Fibonacci  buffer  set  was  used;  in  the  other,  the  2- 
exponential  buffer  set  was  used.  The  switch  parameters  for 
both  studies  were  M  =  N  =  2,  r  =  8,  and  p  =  0.8.  The 
results  of  the  two  studies,  which  demonstrate  the  super¬ 
iority  of  the  Fibonacci  buffer  set,  are  shown  in  Fig.  19.  In  the 
figure,  the  performance  of  the  two  buffer  sets  is  comparable 
when  the  number  of  fibers  is  five  or  fewer.  When  the 
number  of  fibers  is  increased  from  five  to  six  and  from  six  to 
seven,  the  performance  of  the  2-exponential  buffer  set 
worsens,  while  the  performance  of  the  Fibonacci  buffer  set 
continues  to  improve.  In  particular,  when  the  number  of 
fibers  is  seven,  the  cell  loss  rate  effected  by  the 
2-exponential  buffer  set  is  10”1,  three  orders  of  magnitude 
larger  than  that  achieved  by  the  Fibonacci  buffer  set. 


Number  of  Fibers  Used 


Fig.  19.  Cell  loss  rates  achieved  by  the  Fibonacci  and  the  2-exponential 
buffer  sets  for  the  case  where  p  -  0.8  and  r  =  8. 
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Fig.  20.  The  number  of  fibers  required  in  the  empirically  lossless  buffer 
set  for  p  =  0.8  and  for  various  values  of  the  time  correlation  index  r. 


4.5  Performance  Characteristics 

The  architecture  exhibits  the  following  performance 
characteristics: 


1 .  The  cell  loss  rate  decreases  exponentially  as  the  number  of 
fibers  increases.  The  cell  loss  performance  curves  in 
Figs.  7, 8, 9, 10, 11,  and  12  indicate  that  an  increase  of 
one  fiber  results  in  approximately  an  order  of 
magnitude  decrease  in  the  cell  loss  rate  of  the 
architecture.  For  example,  for  .the  case  where  p  =  0.8 
and  r  =  32,  the  cell  loss  rate  corresponding  to  the 
buffer  set  n  =  (2, 2, 2, 2,  lfl,  1, 1, 1)  is  on  the  order  of 
10  J,  but,  when  the  buffer  set  is  augmented  to 
(2.2,2, 2,2, 1, 1. 1. 1),  the  cell  loss  rate  decreases  to 
the  order  of  10~G. 

2.  The  latency  mass  function  decays  rapidly  as  the  latency 
increases.  The  mode  of  dmi„  is  between  0  and  1,  and 
the  relative  frequencies  decay  rapidly  as  the  delay 
increases,  e.g.,  the  percentage  of  cells  experiencing  a 
minimum  delay  of  k  time  slots  is  approximately  an 
order  of  magnitude  larger  than  the  percentage  of 
cells  that  are  delayed  for  k+  1  time  slots.  This 
pattern,  which  is  present  in  all  of  the  simulation 
studies,  indicates  that  most  cells  are  not  artificially 
delayed  by  the  architecture  for  the  purpose  of 
reducing  cell  loss. 

3.  The  deleterious  effect  of  an  exponential  increase  in  time 
correlation  can  be  mitigated  by  a  linear  increase  in  the 
number  of  fibers.  The  simulation  results  indicate  that 
the  effect  of  an  exponential  increase  in  r  can  be  offset 
by  the  addition  of  a  few  fibers,  as  shown  in  Fig.  20. 
For  example,  for  the  case  where  p  =  0.8  and  r  =  1, 
the  empirically  lossless  buffer  set  consists  of  seven 
fibers.  For  the  case  where  p  =  0.8  and  r  =  32,  the 
buffer  set  corresponding  to  a  lossless  architecture 
requires  15  fibers.  The  effect  of  a  25-fold  increase  in 
time  correlation  is  negated  by  the  addition  of  only 
eight  fibers. 


5  Related  Work 

There  are  two  general  approaches  to  designing  a  packet 
switch  using  optical  delay  fibers  as  buffers:  deflection  and 
nondeflection.  The  driving  principle  behind  the  deflection 
approach,  which  is  used  in  [2],  [3],  [4],  is  to  distribute  the 
packets  so  that  the  traffic  arriving  at  each  switching  element 
is  sufficiently  smooth  or  noncorrelated.  The  result  is  that  a 
small  number  of  buffers  within  a  switching  element, 
coupled  with  the  deflection  of  packets  to  noncongested 
outputs,  suffices  to  achieve  a  low  packet  loss  rate.  A 
significant  drawback  of  this  approach,  which  limits  its 
applicability  to  ATM  networks,  is  that  packets  may  be 
delivered  out  of  order  and  that  latency  may  be  high. 

The  approaches  proposed  by  Varvarigos  in  [14]  can 
alleviate  many  of  the  deleterious  effects,  e.g.,  out-of-order 
packet  delivery  and  high  latency,  of  the  deflection  ap¬ 
proach.  Although  those  approaches  can  guarantee  lossless 
communication  and  use  only  a  few  fibers,  they  require  the 
flow  control  strategy  of  [6],  which  ensures  that  the  traffic 
satisfies  a  certain  smoothness  constraint.  The  effectiveness 
of  the  flow  control  strategy  depends  on  the  selection  of  an 
appropriate  frame  size  F.  A  large  value  of  F  may  result  in 
high  latency  and  may  require  the  use  of  a  large  number  of 
fibers  to  achieve  lossless  communication.  On  the  other 
hand,  the  burstiness  of  traffic  that  can  be  sent  over  a 
connection  is  proportional  to  F.  To  accommodate  traffic 
with  a  high  degree  of  burstiness  or  time  correlation,  i.e., 
traffic  typically  found  in  today's  ATM  networks,  a  large 
value  of  F  must  be  used,  but  this  results  in  high  latency  and 
low  throughput.  The  approaches  proposed  by  Varvarigos, 
therefore,  inherit  some  of  the  disadvantages  of  the  flow 
control  strategy. 

In  the  nondeflection  approach,  which  is  the  focus  of  this 
paper,  the  emphasis  is  on  providing  sufficient  buffering  at 
each  switching  element  so  that  cell  loss  is  kept  below  an 
acceptable  threshold.  Among  the  research  studies  that  use 
the  nondeflection  approach,  the  closest  in  spirit  to  this 
paper  are  [5],  [7]. 

In  [5],  Cruz  and  Tsai  propose  a  2  x  2  optical  packet 
switch.  The  "baseline"  switch  consists  of  n  stages  of  2-state 
switching  elements,  with  each  switching  element  having  an 
optical  delay  fiber  of  equal  length  A  to  buffer  excess 
packets.  Cruz  and  Tsai  show  that,  with  a  sufficiently  large 
A  and  noncorrelated  traffic  at  the  inputs,  the  baseline 
switch  can  achieve  a  low  cell  loss  rate  while  using  only  a 
few  stages  or  fibers.  When  the  traffic  on  the  input  channels 
is  correlated,  the  analysis  is  no  longer  applicable  and  the 
performance  of  the  baseline  switch  is  essentially  unknown. 

An  "augmented"  switch  is  obtained  by  inserting  additional 
optical  delay  fibers  between  adjacent  stages  in  the  baseline 
switch.  Those  fibers  must  be  large  and  of  relatively  prime 
lengths;  however,  only  a  few  fibers  suffices  to  achieve  a  low 
cell  loss  rate. 

While  Cruz  and  Tsai  focus  on  the  design  of  a  2x2 
switch,  Haas  considers  the  construction  of  an  arbitrary 
dimension  packet  switch  in  [7],  The  switching  fabric  of  the 
architecture  consists  of  k  optical  delay  fibers  of  unequal 
lengths.  An  arriving  packet  is  inserted  into  one  of  the  fibers 
such  that  there  is  no  transmission  collision  at  any  of  the 
output  channels  and  the  packet  sequence  integrity  is 
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maintained.  Packets  that  cannot  be  accommodated  are 
discarded. 

There  are  important  differences  between  our  switch  and 
the  switches  considered  in  [5],  [7].  While  the  baseline  switch 
of  [5]  can  achieve  a  good  cell  loss  performance  with  only  a 
few  fibers,  the  latency  is  nA,  which  may  be  excessive  for  an 
ATM  switch.  Furthermore,  good  performance  can  be 
achieved  only  if  the  input  traffic  is  noncorrelated.  While 
the  noncorrelated  traffic  class  may  be  an  adequate  approx¬ 
imation  for  the  traffic  in  a  very  high-speed  future  ATM 
network,  in  today's  environment,  ATM  traffic  tends  to  be 
heavily  correlated.  If  the  augmented  switch  of  [5]  were  used 
to  accommodate  time-correlated  traffic,  the  packet  sequence 
integrity  would  not  be  maintained  and  the  latency  would  be 
prohibitively  high,  e.g.,  on  the  order  of  thousands  of  time 
slots;  consequently,  neither  the  baseline  switch  nor  the 
augmented  switch  is  suitable  for  ATM  applications.  In 
comparison,  our  design  yields  a  low  cell  loss  rate  even 
under  highly  time-correlated  traffic  and  results  in  a  much 
lower  latency.  While  our  control  algorithm  is  more 
complex,  its  expected  time  complexity  is  0{logDrnax),  which 
is  suitable  for  high-speed  operation. 

The  pioneering  design  of  [7]  provides  a  framework  for 
building  an  optical  packet  switch,  but  the  issue  of  how  to 
resolve  the  trade-off  between  using  few  fibers  and  achiev¬ 
ing  a  low  loss  rate  has  not  been  adequately  addressed. 
Indeed,  it  appears  that  the  design  would  require  a  large 
number  of  fibers  to  achieve  a  sufficiently  low  loss  rate 
acceptable  to  ATM  applications.  For  example,  consider  the 
case  where  the  number  of  input  and  output  channels  is  4, 
the  fiber  lengths  are  5,  10,  15,  and  20,  the  offered  load  on 
each  input  is  0.8,  and  the  expected  burst  length  is  10,  which 
roughly  corresponds  to  r  =  2.  With  four  fibers,  the 
corresponding  loss  rate  is  on  the  order  of  10-1  for  the 
architecture  of  [7].  In  contrast,  for  the  similar  case  where 
p  =  0.8  and  r  =  2,  and  where  the  expected  burst  length  is 
also  10,  our  design  requires  only  seven  fibers  to  achieve  an 
empirically  lossless  architecture.  Because  each  simulation 
run  generates  on  the  order  of  107  cells,  an  empirical  zero  cell 
rate  suggests  that  the  actual  cell  loss  rate  effected  by  the 
Fibonacci  switch  is  smaller  than  0(10“°).  Hence,  the  use  of 
three  additional  fibers  by  the  Fibonacci  switch  has  resulted 
in  a  reduction  of  at  least  four  orders  of  magnitude  in  the  cell 
loss  rate. 

6  Conclusion 

An  ATM  switch  architecture  using  optical  delay  fibers  as 
buffers  has  been  presented.  Because  of  the  remarkable 
properties  of  the  Fibonacci  numbers,  the  architecture  is 
computationally  efficient,  requires  few  fibers,  and  yields 
low  latency  and  cell  loss  rate  across  diverse  time-correlated 
and  heavily  loaded  traffic  profiles.  The  buffer  sets  corre¬ 
sponding  to  an  empirically  lossless  2x2  ATM  switch 
architecture  under  diverse  traffic  scenarios  have  been 
presented.  The  architecture  exhibits  the  desirable  character¬ 
istics  that  the  cell  loss  rate  decreases  exponentially  as  the 
number  of  fibers  increases,  that  the  probability  mass 
function  for  the  sample  latency  decays  rapidly,  and  that 
an  exponential  increase  in  the  time  correlation  of  traffic  can 
be  offset  by  a  small  increase  in  the  number  of  fibers. 


Because  of  these  salient  characteristics,  this  architecture 
may  play  an  integral  role  in  future  optical  ATM  networks. 


Appendix  A 

Index  of  Largest  Fibonacci  Number  <  Dmax 


We  wish  to  determine  the  number  m  of  fiber  types  required 
by  a  Fibonacci  buffer  set  to  represent  any  integer  between  1 
and  Dmax,  inclusive.  Since  m  +  1  is  the  index  of  the  largest 
Fibonacci  number  less  than  or  equal  to  D  max /  i-Cv  Fm+l  ^ 
Dmax  and  Fm+2  >  Dmax/  the  index  m  +  1  of  the  largest 
Fibonacci  number  Fm+ i  <  Dmax  may  be  computed  as 
follows: *  From  [13],  FTI  =  ^4>n  -  where  4>  =  ^  (1  + 
\/5)  and  <£  =  ^(1  —  \/5).  Since  (I^I)ri|  <  I, 


1 

ivs*  +i 


Then,  Fm+i  =  [jr6T"+l  +§J.  Since  +  ±J  <  D„ 

+\<  Dm ox+1 

4>m+1  <  V5  (Dmar  +  l 


m 


+  1  ^  l09d>  ^D-max  “1“  2^  ^ 


m  + 1  = 


l09d  ( ^ max  H”  0 


Since  log^x  =  the  number  of  distinct  fiber  types  m 
required  to  represent  an  integer  in  [1  ,Dmax)  is 


^Pio(  v^5(Fmaa-  -b  2)) 
logio4> 


-1. 


Appendix  B 

Number  of  Length-h  Number  Sequences  on 
k  Fiber  Types 

We  wish  to  determine  the  number  of  ways  to  construct  a 
fiber  sequence  of  length  n  using  fibers  of  lengths  L\, 
£2,..., I*/  without  considering  the  ordering  of  the  fibers 
within  a  sequence.  Let  TLl{z)  he  the  generating  function  of 
the  number  of  ways  to  construct  a  sequence  of  length  n 
using  only  lengths  Li,  . .  - , Then, 

Txj  (z)  =  1  +  zLl  +  2?Li  4- . . . 

Tl,., (z)  =  (1  +  /-■  +  rL‘-'  +  . .  .)TLi(z). 


This  implies  that: 


TlM 


1 


The  number  of  length-n  fiber  sequences  on  Li, L2,  •  •  ■  >  -L*  is 
the  coefficient  of  the  zn  term  of  Tik{z).  Although  we  could 
not  obtain  a  closed-form  expression  of  this  coefficient,  the 
form  of  TLk  (z)  suggests  that  it  is  an  exponential  function  of 
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n,  k,  and  L\ ,  £2-  *  * ,  Lk  and  that  it  is  impractical  to  examine 
all  possible  length-?!  fiber  sequences. 


Appendix  C 

Greedy  Fibonacci  Representation 
of  an  Integer  n 

We  prove  by  induction  that  a  greedy  Fibonacci  representa¬ 
tion  of  any  positive  integer  n  exists.  If  n  =  1,  then  n  =  F2; 
therefore,  the  basis  of  the  induction  holds.  Suppose  that 
there  exists  a  greedy  representation  for  any  integer  n  such 
that  n  <  n.  Let  n  >  2  and  Fk  be  the  largest  Fibonacci  number 
less  than  or  equal  to  n,  i.e.,  Fk  <  n  and  FM  >  n.  Thus,  n  - 
Fk  <  Fk-i  since,  otherwise,  Fk- 1  +  Fk  =  FM  <  n.  This 
proves  that  F*_i  cannot  be  included  and  that  no  adjacent 
Is  are  possible  in  the  greedy  representation.  Finally, 
n  =  Fk  +  (n  -  Fk).  By  the  induction  hypothesis,  there  exists 
a  greedy  Fibonacci  representation  of  n  -  Fk;  thus,  there 
exists  a  greedy  Fibonacci  representation  of  n  which  is  the 
concatenation  of  Fk  and  the  greedy  representation  of  n  —  Fk. 


Appendix  D 

Self-Convolution  of  the  Fibonacci  Sequence 

Our  goal  is  to  determine  Fv^kFk/  which  is  required  for 
the  derivation  of  the  salient  properties  of  the  Fibonacci 
numbers  given  in  Appendix  E.  F„-kFk  is  the  coeffi¬ 
cient  of  the  ?ith  term  of  F2(z ),  where  F(z)  is  the  generating 
function  of  the  Fibonacci  sequence  and,  from  [13], 
F(z)  =  yzfrr?  •  Performing  a  partial  fraction  expansion  on 

F(z)  =  ,  we  determine  that  F(z)  =-yr(-r1 - M. 

Then, 


F-(z)  =-(— - Lr-) 

w  5  \i  -  4>Z  1  -4>z) 


F2U)=l 


5\(1~^)2  {1  -  <t>z)(l  -  <t>z)  (1  -4>zfJ' 

“><*  uip  -  y + i)#v, 

= |  (E  O' +  l^zi  - 2  E  + 1)^')  • 

\j=0  j= 0  j= 0  ) 


Since  =  1  +  <pz  +  4?z2  +  •  •  •  +  f‘z"  +  ■■■,  the  coeffi¬ 
cient  of  the  zn  term  of  is  d>n.  Similarly,  the  Hth 

coefficient  of  — ^  is  cV .  This  implies  that  <j>r'  +  4>n  is  equal 
to  the  coefficient  of  the  z"  term  of ,,  .  h - — .  Thus,  since 

4>  +  d>  =  1  and  <p<t>  =  —1, 


1  |  1 _  2  -{<j>  +  $)z  _  2-z 

(1  ~4>z)  (1  -  4>z)  (1  -<j>z)(\  -4>z)  ~  1  -z-z2 

2-z  2  z 

l-z-z2~l- z-z2  1- z-z2' 

The  coefficient  of  the  zn  term  of  jzirzi  can  be  determined  as 
follows: 


(\  -  4>z)(\  -  <f>z) 


F(z) 


Since 

F(z) 


T2  =  Fez'1  +  FlZ°  +  F2z1  +  ---  +  Fnzn~l  +  F„+1zn  +  ■■■, 


the  coefficient  of  the  zn  term  of 


is  2F„+i.  The 


generating  function  of  the  Fibonacci  sequence  is  ^ and 
Fn  is  the  coefficient  of  the  zn  term.  Thus,  the  coefficient  of 
the  zn  term  of  .  2~~  ■>  is 


2  Fr 


n+ 1 


■Fri. 


This  implies  that  <j>T'  +  <j>n  =  2 FT 


n+1 


££o  U  +  1)^  +  1)4? zj, 


F„.  Since  — 1-;,  = 

(1-0-) 


r  4-  - 


(l-(j>z)2  (1  -4>Z)2  jrS' 

=  '£(j+l)(2FJ+1-FJ)z’ 
j-  0 

1  OC  n  OC 

F2 (z)  =  gX>'  +  l)(2^'+i  -  Fj)z*  -tT  F^F 


j=0 

OC 


j= 0 


^2(z)  =S((i+1)(2f;+i  -  *})  -  2FJ+1)^. 


5f=o 


The  coefficient  of  the  2"  term  of  F2(z)  is: 

2nFn+i  -  (n  +  1)F„ 


The  self-convolution  of  the  Fibonacci  sequence  is: 


2nF„+i  -  (n  +  1)F„ 


fc=0 


Appendix  E 

Properties  of  the  Greedy  Fibonacci 
Representation 

We  now  demonstrate  several  salient  properties  of  the 
greedy  Fibonacci  representation.  If  D  =  where 

<?j  €  {0, 1},  we  express  D  as  a  =  <rn<7n2i  •  •  ■  crj .  The  n-digit 
greedy  representation  o  =  er„  •  •  ■  ax  is  referred  to  as  an 
n-string. 

Property  A.  =  F{ri+1).kFk  fori  <  k  <  n,  where  refers  to 
the  number  of  n-strings  in  which  the  kth  digit  is  1 . 

Proof.  Let  1  <  k  <  n.  The  fact  that  crk  —  1  implies  that  ak+\  = 
0  and  ak~\  =  0  since  no  consecutive  Is  are  possible  in  a 
greedy  Fibonacci  representation.  Then,  a  =  aOlO/3,  where 
a  and  0  are  an  (n  -  k  —  l)-string  and  a  (k  -  2)-string, 
respectively.  This  suggests  that  Th)  =  iVn_A._iiVA.„2/  where 
Nj  is  the  number  of  possible  ^-strings.  Since  all  numbers 
from  0  to  Fj+ 2  -  1  can  be  represented  by  a  j-string, 
Nj  =  Fj+2;  therefore,  =  Fn„k+1Fk  for  1  <  k  <  n.  When 
k  =  1,  a  =  7OI,  where  7  is  an  (n  -  2)-string.  This  implies 
that  Tin  =  Nn- 2  =  Fn.  Similarly,  Tnr}  =  7V„_2  =  F„.  Since 
F„  =  F„_n+iFn  and  Fn  —  Fn_i+iFi,  =  Fr,-k+iFk  for  all 
/c  such  that  1  <  k  <n.  □ 
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Fig.  21 .  Cell  loss  rates  for  the  cases  where  all  permutations  and  two 
permutations,  respectively,  of  a  fiber  sequence  are  examined  and  where 
p  =  0.8  and  r  =  8. 


Property  B.  Tn  =  /  -where  Tr,  is  the  total 

number  of  Is  in  all  n-strings. 

Proof. 


Tr,  =  J2  rte  =  ]TF(n+1Ht 

k=i  k=l 


Fk. 


Since  —  Fq  —  0  when  k  —  n  +  1  and  Fk  —  0  when 

k  =  0,  T„  -  F„+i-kF>.-.  From  Appendix  E, 

E"-=o  F»-kFk  =  This  implies  that 


T„ 


77+1 

=  ^F(n+i)_fcFt  = 

j= o 


2(n  +  1)F,h.2  -  (n  +  2)Fn+i 
5 


□ 


Property  C.  Xri  = 


2(77+1  -(7t+2)F„-t  _ l_ 


where  Xv  is  the 


average  number  of  Is  in  a  nonzero  n-string. 

Proof.  Xn  is  the  ratio  of  the  sum  of  the  total  number  of  Is  in 
all  ^-strings  and  the  number  of  nonzero  M-strings,  i.e., 
X7I  =  .  The  assertion  follows  from  Property  B.  □ 

Property  D.  F„=^  =  where  E„ 

is  the  probability  that  a  digit  in  an  n-string  contains  a  1. 


Proof.  Implied  in  Property  D  is  that  the  n-string  is  chosen 
uniformly  among  all  possible  nonzero  n-strings.  Ev  is  the 
ratio  of  the  average  number  of  Is  in  a  nonzero  n-string 
and  the  number  of  digits  in  an  n-string,  i.e.,  which  is 
the  assertion.  □ 

Property  E. 


,  „  2 

lim  En  =  - 

71 — *CC  5 


11 


Proof.  Since 


p  _  2(n  +  l)F„.o 
~  on(y.,~  1) 


(t»+2)F„-i 


as 


n 


oo, 


F„+ 2  -  1  «  jF„+ o.  Since  F„  =  +  il/  lim„-3c  Fn  = 

^ t>T‘  and  lim7,_x  jfy  —  This  implies  that 


lim  E„  =  |  0.2763932. 

77— ‘C5C  5 

□ 


Appendix  F 

Effect  of  Searching  All  Permutations 

We  now  demonstrate  that  searching  for  all  permutations  of 
a  fiber  sequence,  in  general,  does  not  appreciably  decrease 
the  cell  loss  rate  of  the  architecture.  As  an  example,  consider 
the  following  two  simulation  studies  with  r  =  8  and 
p  —  0.8.  In  one  of  the  simulation  studies,  the  algorithm 
searched  only  the  two  permutations  of  a  fiber  sequence,  as 
described  in  Section  4.1;  in  the  other  simulation  study,  the 
algorithm  examined  all  possible  permutations.  As  indicated 
in  Fig.  21,  the  performance  of  the  two  studies  is  nearly 
identical.  Searching  for  only  two  permutations  suffices  to 
achieve  nearly  the  same  level  of  performance  as  that  of 
exhaustive  searching  and,  therefore,  is  preferable  as  it 
results  in  significantly  lower  computational  complexity. 
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